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GAB1, A GRB2 BINDING PROTEIN, 
AND COMPOSITIONS FOR MAKING AND METHODS OF USING THE SAME 



FIELD OF THE INVENTION 

The invention relates to the identification and 
5 cloning of the gene that encodes Grb2 associated binder 1 
protein (Gabl) , a protein that is involved in tyrosine kinase 
activation pathways, to isolate Gabl protein and to methods of 
making and using the same. 

BACKGROUND OF THE INVENTION 

10 Signaling by receptor protein tyrosine kinases (RPTKs) 

involves the activation of multiple distinct pathways. Grb2 
is ubiquitously expressed as a 25 kDa protein that plays a 
central role in signaling by several receptors (Lowenstein, 
E.J., et al. (1992) Cell 70:431-442 and Downward, J. (1994) FEBS 

15 Letters 338:113-117) . It functions as an adaptor protein where 
its central SH2 domain binds to an autophosphorylation site on 
the receptor and the two flanking SH3 domains link to effector 
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molecules. One such target is the mammalian homolog of SOS 
which is a guanine nucleotide exchange factor for ras, so that 
Grb2 links receptors with the ras pathway. It is now clear 
that the SH3 domains also link to a variety of other proteins 
5 involved in signaling including Vav (Ren, R., et al . (1994) 
Genes Dev. 8:783-95), c-aJbl (Ye, Z.S., and Baltimore, D. (1994) 
Proc. Natl. Acad. Sci. U.S.A. 91:12629-12633), dynamin (Gout, 
I., et al. (1993) Cell 75:25-36), and SLP-76 (Jackman, J.K., 
et al. (1995) J. Bio. Chem. 270:7029-7032), but several other 

10 binding proteins have been noted during T and B cell signaling 
(Reif, K. et al, (1994) J. Biol. Chem. 269:14081-14087 and 
Motto, D.G., et al. (1994) J. Bio. Chem. 269:21608-21613). 

There is a need to identify additional proteins 
involved in tyrosine kinase activation pathways. There is a 

15 need to isolate proteins involved in tyrosine kinase activation 
pathways, and for compositions and methods for producing and 
isolating proteins involved in tyrosine kinase activation 
pathways. There is a need to isolated nucleic acid molecules 
that encode proteins involved in tyrosine kinase activation 

2 0 pathways. There is a need for compounds which modulate the 
activity of proteins involved in tyrosine kinase activation 
pathways. There is a need for kits and methods of identifying 
such compounds . 

SUMMARY OF THE INVENTION 

25 The invention relates to substantially pure proteins 

that have amino acid sequences shown in SEQ ID NO: 2. 

The invention relates to pharmaceutical compositions 
comprising a protein that has the amino acid sequence shown in 
SEQ ID NC:2 in combination with a pharmaceutically acceptable 

30 carrier. 

The invention relates zo isolated nucleic acid 
molecules that comprise nucleic acid sequences that encode a 
protein that has an amino acid sequence shown in SEQ ID NO: 2. 

The invention relates to pharmaceutical compositions 
35 that comprise nucleic acid molecule that comprise nucleic acid 
sequences that encode a protein that has an amino acid sequence 
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shown in SEQ ID NO: 2 in combination with a pharmaceutical^ 
acceptable carrier. 

The invention relates to isolated nucleic acid 
molecules that consist of SEQ ID NO: 1 or a fragment thereof 
5 having at least 5 nucleotides. 

The invention relates to a recombinant expression 
vector comprising the nucleic acid molecule that has a 
nucleotide sequence that comprises SEQ ID NO:l. 

The invention relates to a host cell comprising a 
10 recombinant expression vector comprising the nucleic acid 
molecule that has a nucleotide sequence that comprises SEQ ID 
NO:l. 

The invention relates to an oligonucleotide molecule 
comprising a nucleotide sequence complimentary to a nucleotide 
15 sequence of at least 5 nucleotides of SEQ ID NO:l. 

The invention relates to isolated antibodies that bind 
to an epitope on SEQ ID NO:2. 

The invention relates to methods of identifying 
substrates, activators or inhibitors of Gabl. 
20 The invention relates to methods and kits for 

identifying compounds that modulate Gabl phosphorylation by 
epidermal growth factor receptor (EGFR) , insulin receptor, 
insulin growth factor 1 (IGF-1) receptor, platelet derived 
growth factor (PDGF) receptor, hepatocyte growth factor (HGF) 
25 receptor, TrkA receptor, IL-3 receptor, B cell receptor, or 
keratinocyte growth factor (KGF) receptor. 

The invention relates to methods and kits for 
identifying compounds that modulate PI -3 kinase protein binding 
to Gabl protein. 

30 The invention relates to methods of inhibiting 

expression of Gabl by contacting cells that express Gabl with 
a nucleic acid molecule that comprises an antisense nucleotide 
sequence that prevents transcription of Gabl gene sequences or 
translation of Gabl mRNA. 

35 The invention relates to non-human transgenic animals 

that comprise a transgene which includes a nucleotide sequence 
that encodes Gabl. 
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The invention relates to non-human transgenic knock 
out animals that comprise a nucleotide sequence within the 
animal's Gabl gene to render the animal incapable of expressing 
functional Gabl. 

5 BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 shows the amino acid sequence of Gabl. 
Potential tyrosine phosphorylation sites which correspond to 
known motifs for SH2 domains are boxed. These SK2 domains, the 
respective motif and the positions are: Grb2 , Y-X-N-X, (Y 48 ) ; 

10 Nek, Y-D-X-P, (Y 2 < 2 , Y 30 7 , Y 406 ); PLC- n , Y-X-I-P, (Y 30 \ Y 373 , Y 406 ) ; 
PI-3-kinase, Y-X-X-M (Y 447 , Y 472 , Y 589 ) ; and SHPTP2/syp2, Y-X-D-L 
(Y 627 ) . The predicted site for SHPTP2/syp is derived from the 
sequence in IRS-1 which has been shown to bind this SH2 domain. 
The two potential binding sites for the SH3 domains of Grb2 

15 based on the motif P-X-P-X-X-P are shown in bold. The complete 
nucleotide sequence for Gabl will be deposited in GenBank. 

Figure 2 shows a sequence comparison illustrating 
homology between the PH domains of Gabl and IRS-1. Alignment 
between amino acid 14-116 of Gabl with amino acids 13-115 of 

20 human IRS-1 (Araki, E. et al . (1993) DiaJbetes 42:1041-1054). 
Hyphens denote gaps introduced to maximize the alignment, 
colons indicate amino acid identity and periods denote 
similarity. For comparison, the consensus sequence deduced for 
the six conserved subdomains of PH domains (I -VI) is shown 

25 below. 

Figure 3 shows a map depicting the relative 
distribution for predicted serine/threonine and tyrosine 
phosphorylation sites in Gabl compared to those found in human 
IRS-1 . The sites are shown for casein kinase II (S/T-X-X-E/D) , 

30 cAMP dependent kinase (R/K/R/K-X-S/T) , cdc2 kinase (S/T-P-X- 
K/R) , MAP kinase (P-X-S/T-?) , and protein kinase C (S/T-X-R/K) . 
Potential tyrosine phosphorylation sites that correspond to 
predicted binding sites for SH2 domains are shown. Also 
depicted are structural features including the pleckstrin 

35 homology domain (PH domain) , the proline/serine rich regions, 
the bind sites for the SK3 domains of Grb2 (PRS) and the region 
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in IRS-l that binds to phosphotyrosine (Phosphotyrosine 
binding) . 

Figures 4A-4J show results from experiments described 
in Example 2 which investigate Gabl mediation of PI -3 kinase 
5 activity, cell growth and transformation. 

DETAILED DESCRIPTION OF THE INVENTION 

Understanding the molecular mechanisms by which the 
epidermal growth factor (EGF) receptor and insulin receptor 
transmit their signals within the cell are essential to 

10 understanding fundamental processes such as cell growth and 
differentiation as well as diseases such as cancer and 
diabetes. The present invention arises out of the discovery 
of a protein, Gabl (Grb2 associated binder-1) , which is 
involved in receptor protein signalling pathways. Gabl is 

15 involved in transmitting the signals for, but not limited to, 
insulin and epidermal growth factor receptors. 

Several receptor tyrosine kinases such as the EGF 
receptor, the neu/HER-2 receptor, the PDGF receptor, and the 
IGF-l receptor have been shown to either initiate the cell 

20 signalling cascade that leads to tumor formation or to 
accelerate the growth of tumors. These tyrosine kinases 
communicate via a complex network of intracellular proteins. 
In the context of carcinogenic effects, Gabl is a protein that 
is directly phosphorylated by at least the EGF and insulin 

25 receptor. The phosphorylation of Gabl then creates binding 
sites for proteins that are known to result in tumorigenic or 
tumor promoting effects such as PLC-y, PI-3-kinase. and 
SHPTP2/syp. Thus, overexpression of Gabl in tumors may 
facilitate the communication by tyrosine kinases to PLC-t. pi_ 
30 3 -kinase, SHPTP2/syp, etc. and augmenting the tumorigenic or 

tumor promoting effects. 

The action of insulin or. cells is communicated via 
binding to and activation of the insulin receptor, which is 
also a tyrosine kinase. This receptor phosphorylates several 
35 substrates, of which insulin receptor substrate-1 (IRS-D ^ 
the best characterized but not the only substrate. IRS" 1 1S 
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the prototype of the docking protein family to which Gabl also 
belongs. When IRS-1 is phosphorylated by the insulin receptor 
it communicates the signals for the uptake of glucose and the 
initiation of cell division. At least some of the molecules 
5 that bind to IRS-1 are PI -3 -kinase, and SHPTP2/syp, both of 
which have been shown to be essential to the insulin receptor's 
effects . 

The cDNA that encodes human Gabl has been cloned and 
its sequences is shown in SEQ ID N0:1. The amino acid sequence 

10 of the protein is shown in SEQ ID NO: 2 and Figure 1. Gabl was 
discovered when Far Western blots were performed with the SH3 
domains of Grb2 and several bands were detected in glial and 
medulloblastoma tumors whose size did not correspond to any 
known protein. cDNA clones for these proteins were obtained 

15 using recombinant Grb2 to screen an expression cDNA library 
from a human glial tumor. The cDNA that encodes Gabl (Grb2 
associated binder- 1) was isolated, sequenced and the amino acid 
sequence of Gabl protein was predicted. The cDNA that encodes 
murine Gabl has also been cloned and its sequences is shown in 

20 SEQ ID NO: 3. The amino acid sequence of the protein is shown 
in SEQ ID NO:4. 

Analysis of Gabl indicates that it is highly conserved 
across species and that it shares amino acid homology and 
several structural features with IRS-1 (insulin receptor 

25 substrate-1) (See Figures 2 and 3) . As shown in Figures 1 and 
3, similar to IRS-1, Gabl has numerous potential 
phosphorylation sites for tyrosine kinases (Gab 1 has 16 sites 
for tyrosine phosphorylation) and serine/threonine kinases 
(Gabl has 4 7 sites for serine/threonine phosphorylation) . 

3 0 Moreover, Gabl has an N- terminal pleckstrin homology domain 
(Figures 2 and 3) and binding sites for the growth factor 
receptor bound- 2 (Grb2) protein. 

Gabl is tyrosine phosphorylated by, but not limited 
to, both the EGF and insulin receptor and like IRS-1; it can 

35 act as a docking protein for several SH2-proteins . Generation 
of an antibody against Gabl has allowed the observation that, 
following the addition of EGF or insulin, Gabl is 
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phosphorylated and this is accompanied by a change in the 
mobility of the protein on SDS-PAGE. This phosphorylation 
allows other proteins, including but not limited to PI -3 
kinase, SHPTP2/syp, PLC-?, and Grb2 to bind to Gabl . Gabl 
5 represents a new signaling protein in the EGF and insulin 
receptor signaling pathways that could integrate the signals 
from several diverse systems. 

The discovery of Gabl has provided the means to design 
and discover specific modulators such as inhibitors and 

10 activators of this signalling protein. According to the 
present invention, Gabl may be used to screen compounds for 
substrates, inhibitors or activators. Identification of 
substrates is useful to further elucidate signalling pathways. 
Inhibitors are useful to interrupt the signalling pathway and 

15 are therefore useful to treat neoplasms that are characterized 
by overexpressed levels of Gabl. Activators are useful as 
facilitate and enhance Gabl activity and are therefore useful 
to treat diabetes characterized by underexpressed or 
insufficiently functional Gabl. Kits are provided for 

20 screening compounds for Gabl inhibitors. Kits are provided for 
screening compounds for Gabl activators. Kits are provided for 
screening compounds for Gabl substrates. The nucleotide 
sequence that encodes the Gabl is disclosed herein and allows 
for the production of pure protein, the design of probes which 

25 specifically hybridize to nucleic acid molecules that encode 
Gabl and antisense compounds to inhibit transcription of Gabl. 
Anti-Gabl antibodies are provided. Anti-Gabl antibodies may 
be inhibitors of Gabl and may be used in methods of isolating 
pure Gabl and methods of inhibiting Gabl activity. 

30 One source of Gabl inhibitor may be compounds designed 

to resemble portions of Gabl which interfere with signal 
transduction processes emanating from Gabl. The basis for such 
molecules may be either peptides taken directly from the amino 
acid sequence of Gabl, or variants thereof, or small molecules 
35 designed by the aid of computer assisted modeling that would 
resemble parts of Gabl. 
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The present invention provides substantially purified 
human Gabl which has the amino acid sequence consisting of SEQ 
ID NO: 2. Human Gabl can be isolated from natural sources, 
produced by recombinant DNA methods or synthesized by standard 
5 protein synthesis techniques. The present invention provides 
substantially purified murine Gabl which has the amino acid 
sequence consisting of SEQ ID NO: 4. Murine Gabl can be 
isolated from natural sources, produced by recombinant DNA 
methods or synthesized by standard protein synthesis 

10 techniques. Purified Gabl proteins may be used as a research 
reagents to study their activity and to identify compounds 
which modulate their activity. Purified Gabl proteins may also 
be used as antigens for generating hybridoma cell lines that 
produce ant i -Gabl antibodies. 

15 Antibodies which specifically bind to Gabl may be used 

to purify the protein from natural sources using well known 
techniques and readily available starting materials. Such 
antibodies may also be used to purify Gabl from material 
present when producing the protein by recombinant DNA 

20 methodology. The present invention relates to antibodies that 
bind to an epitope which is present on Gabl. As used herein, 
the term "antibody" is meant to refer to complete, intact 
antibodies, and Fab fragments and F(ab) 2 fragments thereof. 
Complete, intact antibodies include monoclonal antibodies such 

25 as murine monoclonal antibodies, chimeric antibodies and 
humanized antibodies. The production of antibodies and the 
protein structures of complete, intact antibodies, Fab 
fragments and F(ab) 2 fragments and the organization of the 
genetic sequences that encode such molecules are well known and 

3 0 are described, for example, in Harlow, E. and D. Lane (198 8) 

< 

ANTIBODIES: A Laboratory Manual, Cold Spring Harbor Laboratory, 
Cold Spring Harbor, NY. which is incorporated herein by 
reference. Briefly, for example, Gabl protein, or an 
immunogenic fragment thereof, is injected into mice. The 
35 spleen of the mouse is removed, the spleen cells are isolated 
and fused with immortalized mouse cells. The hybrid cells, or 
hybridomas, are cultured and those cells which secrete 
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antibodies are selected. The antibodies are analyzed and, if 
found to specifically bind to Gabl, the hybridoma which 
produces them is cultured to produce a continuous supply of 
antibodies . 

5 Using standard techniques and readily available 

starting materials, a nucleic acid molecules that encode a 
human Gabl protein may be isolated from a human cDNA library, 
using probes or primers which are designed using the nucleotide 
sequence information disclosed in SEQ ID NO:l. Similarly, a 
10 nucleic acid molecules that encode a murine Gabl protein may 
be isolated from a murine cDNA library, using probes or primers 
which are designed using the nucleotide sequence information 
disclosed in SEQ ID NO: 3. 

The present invention relates to an isolated nucleic 
15 acid molecule that comprises a nucleotide sequence that encodes 
human Gabl that comprises the amino acid sequence of SEQ ID 
NO: 2. In some embodiments, nucleic acid molecules consist of 
a nucleotide sequence that encodes human Gabl. In some 
embodiments, the nucleic acid molecules comprise the nucleotide 
20 sequence that consists of the coding sequence in SEQ ID N0:1- 
In some embodiments, the nucleic acid molecules consist of the 
nucleotide sequence set forth in SEQ ID NO:l. The isolated 
nucleic acid molecules of the invention are useful to prepare 
constructs and recombinant expression systems for preparing 

25 human Gabl protein . 

The present invention also relates to an isolated 
nucleic acid molecule that comprises a nucleotide sequence that 
encodes murine Gabl that comprises the amino acid sequence of 
SEQ ID NO: 4. In some embodiments, nucleic acid molecules 

30 consist of a nucleotide sequence that encodes murine Gabl. In 
some embodiments, the nucleic acid molecules comprise the 
nucleotide sequence that consists of the coding sequence in SEQ 
ID NO: 3. In some embodiments, the nucleic acid molecules 
consist of the nucleotide sequence set forth in SEQ ID N0:3. 

3 5 The isolated nucleic acid molecules of the invention are useful 
to prepare constructs and recombinant expression systems for 
preparing murine Gabl protein. 
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A cDNA library, human or murine, may be generated by 
well known techniques. A cDNA clone which contains one of the 
nucleotide sequences set out is identified using probes that 
comprise at least a portion of the nucleotide sequence 
5 disclosed in SEQ ID N0:1 or SEQ ID N0:3. The probes have at 
least 16 nucleotides, preferably 24 nucleotides. The probes 
are used to screen the cDNA library using standard 
hybridization techniques- Alternatively, genomic clones may 
be isolated using genomic DNA from any human cell as a starting 
10 material . 

The present invention relates to nucleic acid 
molecules that hybridize to portions of DNA molecules that 
encode human Gabl . The present invention relates to isolated 
nucleic acid molecules that comprise a nucleotide sequence 

15 identical or complementary to a fragment of SEQ ID NO:l which 
is at least 10 nucleotides. In some embodiments, the isolated 
nucleic acid molecules consist of a nucleotide sequence 
identical or complementary to a fragment of SEQ ID NO:l which 
is at least 10 nucleotides. In some embodiments, the isolated 

20 nucleic acid molecules comprise or consist of a nucleotide 
sequence identical or complementary to a fragment of SEQ ID 
NO:l which is 15-150 nucleotides. In some embodiments, the 
isolated nucleic acid molecules comprise or consist of a 
nucleotide sequence identical or complementary to a fragment 

25 of SEQ ID NO:l which is 15-30 nucleotides. Isolated nucleic 
acid molecules that comprise or consist of a nucleotide 
sequence identical or complementary to a fragment of SEQ ID 
NO:l which is at least 10 nucleotides are useful as probes for 
identifying genes and cDNA sequence having SEQ ID NO:l, PCR 

30 primers for amplifying genes and cDNA having SEQ ID NO:l, and 
antisense molecules for inhibiting transcription and 
translation of genes and cDNA, respectively, which encode Gabl 
having the amino acid sequence of SEQ id NO: 2. 

The present invention relates to nucleic acid 

35 molecules that hybridize to portion of DNA molecules that 
encode murine Gabl. The present invention relates to isolated 
nucleic acid molecules that comprise a nucleotide sequence 
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identical or complementary to a fragment of SEQ ID NO: 3 which 
is at least 10 nucleotides. In some embodiments, the isolated 
nucleic acid molecules consist of a nucleotide sequence 
identical or complementary to a fragment of SEQ ID NO: 3 which 
5 is at least 10 nucleotides. In some embodiments, the isolated 
nucleic acid molecules comprise or consist of a nucleotide 
sequence identical or complementary to a fragment of SEQ ID 
NO: 3 which is 15-150 nucleotides. In some embodiments, the 
isolated nucleic acid molecules comprise or consist of a 
10 nucleotide sequence identical or complementary to a fragment 
of SEQ ID NO: 3 which is 15-30 nucleotides. Isolated nucleic 
acid molecules that comprise or consist of a nucleotide 
sequence identical or complementary to a fragment of SEQ ID 
NO: 3 which is at least 10 nucleotides are useful as probes for 
15 identifying genes and cDNA sequence having SEQ ID NO: 3, PCR 
primers for amplifying genes and cDNA having SEQ ID NO: 3, and 
antisense molecules for inhibiting transcription and 
translation of genes and cDNA, respectively, which encode Gabl 
having the amino acid sequence of SEQ ID N0:4. 
20 The cDNA that encodes human or murine Gabl may be used 

as a molecular marker in electrophoresis assays in which cDNA 
from a sample is separated on an electrophoresis gel and Gabl 
probes are used to identify bands which hybridize to such 
probes. Specifically, SEQ ID NO:l or portions thereof, may be 
25 used as a molecular marker in electrophoresis assays in which 
cDNA from a sample is separated on an electrophoresis gel and 
human Gabl specific probes are used to identify bands which 
hybridize to them, indicating that the band has a nucleotide 
sequence complementary to the sequence of the probes. The 
30 isolated nucleic acid molecule provided as a size marker will 
show up as a positive band which is known to hybridize to the 
probes and thus can be used as a reference point to the size 
of cDNA that encodes human Gabl. Similarly, SEQ ID N0:3 or 
portions thereof, may be used as a molecular marker in 
35 electrophoresis assays in which cDNA from a sample is separated 
on an electrophoresis gel and murine Gabl specific probes are 
used to identify bands which hybridize to them, indicating that 
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the band has a nucleotide sequence complementary to the 
sequence of the probes. The isolated nucleic acid molecule 
provided as a size marker will show up as a positive band which 
is known to hybridize to the probes and thus can be used as a 
5 reference point to the size of cDNA that encodes murine Gabl. 
Electrophoresis gels useful in such an assay include standard 
polyacrylamide gels as described in Sambrook et al., Molecular 
Cloning a Laboratory Manual, Second Ed. Cold Spring Harbor 
Press (1989) which is incorporated herein by reference. 

10 The nucleotide sequence in SEQ ID NO:l may be used to 

design probes, primers and complimentary molecules which 
specifically hybridize to the unique nucleotide sequences of 
human Gabl. The nucleotide sequence in SEQ ID NO: 3 may be used 
to design probes, primers and complimentary molecules which 

15 specifically hybridize to the unique nucleotide sequences of 
murine Gabl. Probes, primers and complimentary molecules which 
specifically hybridize to nucleotide sequence that encodes 
human or murine Gabl may be designed routinely by those having 
ordinary skill in the art. 

20 The present invention also includes labeled 

oligonucleotides which are useful as probes for performing 
oligonucleotide hybridization methods to identify human or 
murine Gabl genes and cDNA. Accordingly, the present invention 
includes probes that can be labelled and hybridized to unique 

25 nucleotide sequences of human or murine Gabl. The labelled 
probes of the present invention are labelled with radiolabeled 
nucleotides or are otherwise detectable by readily available 
nonradioactive detection systems. In some preferred 

embodiments, probes comprise oligonucleotides consisting of 

30 between 10 and 100 nucleotides. In some preferred, probes 
comprise oligonucleotides consisting of between 10 and 50 
nucleotides. In some preferred, probes comprise 

oligonucleotides consisting of between 12 and 20 nucleotides. 
The probes preferably contain nucleotide sequence completely 

35 identical or complementary to a fragment of a unique nucleotide 
sequence of human or murine Gabl . 
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PCR technology is practiced routinely by those having 
ordinary skill in the art and its uses in diagnostics are well 
known and accepted. Methods for practicing PCR technology are 
disclosed in "PCR Protocols: A Guide to Methods and 
Applications", Innis, M.A., et al. Eds. Academic Press, Inc. 
San Diego, CA (1990) which is incorporated herein by reference. 
Applications of PCR technology are disclosed in "Polymerase 
Chain Reaction" Erlich, H.A., et al . , Eds. Cold Spring Harbor 
Press, Cold Spring Harbor, NY (1989) which is incorporated 
herein by reference. Some simple rules aid in the design of 
efficient primers. Typical primers are 18-28 nucleotides in 
length having 50% to 60% g+c composition. The entire primer 
is preferably complementary to the sequence it must hybridize 
to. Preferably, primers generate PCR products 10 0 basepairs 
15 to 2000 base pairs. However, it is possible to generate 
products of 50 base pairs to up to 10 kb and more. 

PCR technology allows for the rapid generation of 
multiple copies of nucleotide sequences by providing 5' and 3' 
primers that hybridize to sequences present in a nucleic acid 
20 molecule, and further providing free nucleotides and an enzyme 
which fills in the complementary bases to the nucleotide 
sequence between the primers with the free nucleotides to 
produce a complementary strand of DNA. The enzyme will fill 
in the complementary sequences adjacent to the primers. If 
25 both the 5' primer and 3' primer hybridize to nucleotide 
sequences on the complementary strands of the same fragment of 
nucleic acid, exponential amplification of a specific double- 
stranded produce results. If only a single primer hybridizes 
to the nucleic acid molecule, linear amplification produces 
30 single-stranded products of variable length. 

One having ordinary skill in the art can isolate the 
nucleic acid molecule that encodes human or murine Gabl and 
insert it into an expression vector using standard techniques 
and readily available starting materials. 
35 The present invention relates to a recombinant 

expression vector that comprises a nucleotide sequence that 
encodes human Gabl that comprises the amino acid sequence of 
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SEQ ID N0:2. The present invention relates to a recombinant 
expression vector that comprises a nucleotide sequence that 
encodes human Gabl that comprises the amino acid sequence of 
SEQ ID NO:4. As used herein, the term "recombinant expression 
5 vector" is meant to refer to a plasmid, phage, viral particle 
or other vector which, when introduced into an appropriate 
host, contains the necessary genetic elements to direct 
expression of the coding sequence that encodes human or murine 
Gabl. The coding sequence is operably linked to the necessary 

10 regulatory sequences. Expression vectors are well known and 
readily available. Examples of expression vectors include 
plasmids, phages, viral vectors and other nucleic acid 
molecules or nucleic acid molecule containing vehicles useful 
to transform host cells and facilitate expression of coding 

15 sequences. In some embodiments, the recombinant expression 
vector comprises the nucleotide sequence that encodes human 
Gabl set forth in SEQ ID NO:l. In some embodiments, the 
recombinant expression vector comprises the nucleotide sequence 
that encodes murine Gabl set forth in SEQ ID NO : 3 . The 

20 recombinant expression vectors of the invention are useful for 
transforming hosts to prepare recombinant expression systems 
for preparing human or murine Gabl protein. 

The present invention relates to a host cell that 
comprises the recombinant expression vector that includes a 

25 nucleotide sequence that encodes human Gabl that comprises SEQ 
ID N0:2. In some embodiments, the host cell comprises a 
recombinant expression vector that comprises SEQ ID NO:l. The 
present invention relates to a host cell that comprises the 
recombinant expression vector that includes a nucleotide 

3 0 sequence that encodes murine Gabl that comprises SEQ ID NO:4. 
In some embodiments, the host cell comprises a recombinant 
expression vector that comprises SEQ ID NO: 3. 

Host cells for use in well known recombinant 
expression systems for production of proteins are well known 

35 and readily available. Examples of host cells include bacteria 
cells such as £. coli f yeast cells such as S. cerevisiae, 
insect cells such as S. frugiperda, non-human mammalian tissue 
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culture cells Chinese hamster ovary (CHO) cells and human 
tissue culture cells such as HeLa cells. 

The present invention relates to transgenic non-human 
mammals that comprises the recombinant expression vector that 
5 comprises a nucleic acid sequence that encodes human Gabl that 
comprises the amino acid sequence of SEQ ID NO: 2 and to 
transgenic non-human mammals that comprises the recombinant 
expression vector that . comprises a nucleic acid sequence that 
encodes murine Gabl that comprises the amino acid sequence of 

10 SEQ ID N0:4. Transgenic non-human mammals useful to produce 
recombinant proteins are well known as are the expression 
vectors necessary and the techniques for generating transgenic 
animals. Generally, the transgenic animal comprises a 
recombinant expression vector in which the nucleotide sequence 

15 that encodes human or murine Gabl is operably linked to a 
tissue specific promoter whereby the coding sequence is only 
expressed in that specific tissue. One application is that 
the tissue specific promoter is a mammary cell specific 
promoter and the recombinant protein so expressed is recovered 

20 from the animal's milk. In some embodiments, the coding 
sequence that encodes human Gabl is SEQ ID N0:1. In some 
embodiments, the coding sequence that encodes murine Gabl is 
SEQ ID NO: 3 . 

In some embodiments, for example, one having ordinary 
25 skill in the art can, using well known techniques, insert such 
DNA molecules into a commercially available expression vector 
for use in well known expression systems. For example, the 
commercially available plasmid pSE420 (Invitrogen, San Diego, 
CA) may be used for production of Gabl in £. coli. The 

3 0 commercially available plasmid pYES2 (Invitrogen, San Diego, 
CA) may, for example, be used for production in S. cerevisiae 
strains of yeast. The commercially available MAXBAC™ complete 
baculovirus expression system (Invitrogen, San Diego, CA) may, 
for example, be used for production in insect cells. The 

3 5 commercially available plasmid pcDNA I (Invitrogen, San Diego, 
CA) may, for example, be used for production in mammalian cells 
such as Chinese Hamster Ovary cells. One having ordinary skill 
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in the art can use these commercial expression vectors and 
systems or others to produce Gabl using routine techniques and 
readily available starting materials. (See e.g., Sambrook et 
al,, Molecular Cloning a Laboratory Manual, Second Ed. Cold 
5 Spring Harbor Press (1989) which is incorporated herein by 
reference.) Thus, the desired proteins can be prepared in both 
prokaryotic and eukaryotic systems, resulting in a spectrum of 
processed forms of the protein. 

One having ordinary skill in the art may use other 

10 commercially available expression vectors and systems or 
produce vectors using well known methods and readily available 
starting materials. Expression systems containing the 
requisite control sequences, such as promoters and 
polyadenylation signals, and preferably enhancers, are readily 

15 available and known in the art for a variety of hosts. See 
e.g., Sambrook et al., Molecular Cloning a Laboratory Manual , 
Second Ed. Cold Spring Harbor Press (1989) . 

A wide variety of eukaryotic hosts are also now 
available for production of recombinant foreign proteins. As 

20 in bacteria, eukaryotic hosts may be transformed with 
expression systems which produce the desired protein directly, 
but more commonly signal sequences are provided to effect the 
secretion of the protein. Eukaryotic systems have the 
additional advantage that they are able to process introns 

25 which may occur in the genomic sequences encoding proteins of 
higher organisms. Eukaryotic systems also provide a variety 
of processing mechanisms which result in, for example, 
glycosylation, carboxy- terminal amidation, oxidation or 
derivatization of certain amino acid residues, conformational 

3 0 control, and so forth. 

Commonly used eukaryotic systems include, but is not 
limited to, yeast, fungal cells, insect cells, mammalian cells, 
avian cells, and cells of higher plants. Suitable promoters 
are available which are compatible and operable for use in each 

35 of these host types as well as are termination sequences and 
enhancers, e.g. the baculovirus polyhedron promoter. As above, 
promoters can be either constitutive or inducible. For 
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example, in mammalian systems, the mouse metallothionein 
promoter can be induced by the addition of heavy metal ions. 

The particulars for the construction of expression 
systems suitable for desired hosts are known to those in the 
5 art. Briefly, for recombinant production of the protein, the 
DNA encoding the polypeptide is suitably ligated into the 
expression vector of choice. The DNA is operably linked to all 
regulatory elements which are necessary for expression of the 
DNA in the selected host. One having ordinary skill in the art 
10 can, using well known techniques, prepare expression vectors 
for recombinant production of the polypeptide. 

The expression vector including the DNA that encodes 
a Gabl protein is used to transform the compatible host which 
is then cultured and maintained under conditions wherein 
15 expression of the foreign DNA takes place. The protein of the 
present invention thus produced is recovered from the culture, 
either by lysing the cells or from the culture medium as 
appropriate and known to those in the* art . One having ordinary 
skill in the art can, using well known techniques, isolate Gabl 
20 protein that is produced using such expression systems. The 
methods of purifying a Gabl protein from natural sources using 
antibodies which specifically bind tc human or murine Gabl as 
described above, may be similarly applied to purifying human 
or murine Gabl produced by recombinant DNA methodology. 
25 Examples of genetic constructs include Gabl coding 

sequence operably linked to a promoter that is functional in 
the cell line into which the constructs are transfected. 
Examples of constitutive promoters include promoters from 
cytomegalovirus or SV40. Examples of inducible promoters 
3 0 include mouse mammary leukemia virus or metallothionein 
promoters. Those having ordinary skill in the art can readily 
produce genetic constructs useful for transfecting with cells 
with DNA that encodes a Gabl protein from readily available 
starting materials. Such gene constructs are useful for the 
35 production of the Gabl protein. 

In some embodiments of the invention, transgenic non- 
human animals are generated which express human or murine Gabl. 
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The transgenic animals that express human Gabl according to one 
embodiment of the invention contain SEQ ID NO:l under the 
regulatory control of a mammary specific promoter. The 
transgenic animals that express human Gabl according to one 
5 embodiment of the invention contain SEQ ID NO: 3 under the 
regulatory control of a tissue specific promoter. One having 
ordinary skill in the art using standard techniques, such as 
those taught in U.S. Patent No. 4,873,191 issued October 10, 
1989 to Wagner and U.S. Patent No. 4,736,866 issued April 12, 

10 1988 to Leder, both of which are incorporated herein by 
reference, can produce transgenic animals which express human 
or murine Gabl from the transgene. Preferred animals are goats 
and rodents, particularly rats and mice. 

In addition to producing Gabl by recombinant 

15 techniques, automated peptide synthesizers may also be employed 
to produce human or murine Gabl. Such techniques are well 
known to those having ordinary skill in the art and are useful 
if derivatives which have substitutions not provided for in 
DNA- encoded protein production. 

20 Nucleic acid molecules that encode human Gabl may be 

used as part of pharmaceutical compositions for gene therapy. 
Diseases characterized by underexpression of human Gabl may 
include diabetes. Those having ordinary skill in the art can 
readily identify individuals who are suspected of suffering 

25 from such diseases, conditions and disorders using standard 
diagnostic techniques. 

Nucleic acid molecules that encode Gabl may be 
delivered using any one of a variety of delivery components, 
such as recombinant viral expression vectors or other suitable 

30 delivery means, so as to affect their introduction and 
expression in compatible host cells. In general, viral vectors 
may be DNA viruses such as recombinant adenoviruses and 
recombinant vaccinia viruses or RNA viruses such as recombinant 
retroviruses. Other recombinant vectors include recombinant 

35 prokaryotes which can infect cells and express recombinant 
genes. In addition to recombinant vectors, other delivery 
components are also contemplated such as encapsulation in 
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liposomes, transf errin-mediated transfection and other 
receptor-mediated means. The invention is intended to include 
such other forms of expression vectors and other suitable 
delivery means which serve equivalent functions and which 
5 become known in the art subsequently hereto. 

In one embodiment of the present invention, DNA is 
delivered to competent host cells by means of an adenovirus. 
One skilled in the art would readily understand this technique 
of delivering DNA to a host cell by such means. Although the 
10 invention preferably includes adenovirus, the invention is 
intended to include any virus which serves equivalent 
functions . 

In another embodiment of the present invention, RNA 
is delivered to competent host cells by means of a retrovirus. 

15 One skilled in the art would readily understand this technique 
of delivering RNA to a host cell by such means. Any retrovirus 
which serves to express the protein encoded by the RNA is 
intended to be included in the present invention. 

In another embodiment of the present invention, 

20 nucleic acid is delivered through folate receptor means. The 
nucleic acid sequence to be delivered to a cell is linked to 
polylysine and the complex is delivered to cells by means of 
the folate receptor. U.S. Patent 5,108,921 issued April 28, 
1992 to Low et al . , which is incorporated herein by reference, 

25 describes such delivery components. 

Pharmaceutical compositions according to the invention 
include delivery components in combination with nucleic acid 
molecules that encode Gabl which further comprise a 
pharmaceutical ly acceptable carriers or vehicles, such as, for 

30 example, saline. Any medium may be used which allows for 
successful delivery of the nucleic acid. One skilled in the 
art would readily comprehend the multitude of pharmaceutical ly 
acceptable media that may be used in the present invention. 

The pharmaceutical compositions of the present 
35 invention may be administered by any means that enables the 
active agent to reach the agent's site of action in the body 
of a mammal. Pharmaceutical compositions may be administered 
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parenterally , i.e., intravenous, subcutaneous, intramuscular. 
Intravenous administration is the preferred route. 

Dosage varies depending upon known factors such as the 
pharmacodynamic characteristics of the particular agent, and 
5 its mode and route of administration; age, health, and weight 
of the recipient; nature and extent of symptoms, kind of 
concurrent treatment, frequency of treatment, and the effect 
desired. 

According to one aspect of the invention, compounds 
10 may be screened to identify human Gabl inhibitors, activators 
or substrates. Inhibitors of human Gabl are useful as anti- 
tumor agents. Activators of human Gabl are useful as diabetes 
treatment agents. Substrates of human Gabl are useful as 
reagents in assays for screening compounds with Gabl activity 
15 and investigating signal pathways. 

Inhibitors of human Gabl may be identified by 
screening compounds to ascertain their effect on Gabl activity. 
In some embodiments of the invention, compounds are screened 
to identify inhibitors by contacting human Gabl to Grb2 in the 
20 presence or absence of a test compound. Under assay 
conditions, the inhibitors will prevent or reduce binding of 
human Gabl to Grb2. Antibodies which inhibit Gabl/Grb2 binding 
are useful as inhibitors and, therefore as positive controls 
in the assay. 

25 Activators of human Gabl may be identified by 

screening compounds to ascertain their effect on Gabl/Grb2 
binding. In some embodiments of the invention, compounds are 
screened to identify activators by contacting human Gabl to 
Grb2 in the presence or absence of a test compound. Under 

30 assay conditions, the activators will enhance, accelerate or 
increase binding of human Gabl to Grb2 . Antibodies which 
inhibit Gabl/Grb2 binding are useful as negative controls in 
the assay. 

As used herein, the term substrate is meant to refer 
35 to proteins that bind to human Gabl. Cell proteins can be 
screened to identify those proteins that bind to human Gabl. 
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Kits are included which comprise containers with 
reagents necessary to screen test compounds. Such kits include 
human Gabl and Grb2, and instructions for performing the assay. 
Kits may include means to detect and/or measure human Gabl/Grb2 
5 binding such antibodies that bind to human Gabl/Grb2 complex 
but not uncomplexed proteins or antibodies that bind to 
uncomplexed proteins but not human Gabl/Grb2 complex. 
Optionally anti-human Gabl antibodies are provided as a 
control . 

X 0 Drugs that bind to Gabl, which can act as either 

inhibitors or activators of Gabl activity, can be obtained 
through the classic method of screening thousands of small 
organic compounds. An alternative source for potentially 
useful compounds would be random peptides derived from chemical 
15 synthesis or phage display techniques. 

Assays for useful compounds would involve the 
detection of molecules that bind with a high affinity to Gabl, 
where either the molecule or Gabl itself is labeled with an 
isotope or other such reporter molecule. The assay comprises 
2 0 incubating Gabl with the test compound and detecting the level 
of Gabl/test compound binding. Accordingly, the assay 
comprises Gabl, a test compound and a means to determine the 
level of binding. Typically, Gabl is immobilized to a solid 
phase such as being bound to a solid substrate (such as a bead 
25 or plate) and the test compound is labeled and free in 
solution. Incubation for a period of time (usually 30 min. to 
overnight) at a set temperature (22°-37°C) would occur followed 
by washing with buffer. The solid substrate would then be 
analyzed for the presence of the reporter molecule. Kits are 
30 provided which comprise Gabl, preferably bound to a solid 
substrate, and a means of distinguishing unbound Gabl from Gabl 
bound to a test compound. Kits may optionally have positive 
and/or negative controls. Optionally, such kits may also have 
instructions for performing such assays. 
35 Another embodiment provides an assay to identify 

molecules that interfere with the binding of other proteins to 
Gabl. Such proteins include, but are not limited to, the 
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binding of Grb2, PLC-y, PI-3-kinase, and SHPTP2/syp to Gabl . 
According to such embodiments, the second protein (e.g., Grb2, 
PLO7, PI-3-kinase, or SHPTP2/syp) and Gabl are contacted in 
the presence or absence of a test compound. A means is 
5 provided to distinguish Gabl bound to the second protein from 
unbound proteins. The level of binding in the presence and 
absence of the test compound is thereby determined and the 
capacity of the test compound to inhibit binding is so 
assessed. One means of distinguishing bound from unbound 
10 protein is the use of antibodies that specifically bind to 
unbound proteins (unbound Gabl or unbound second protein) but 
not the Gabl/seconds protein complex or antibodies that 
specifically bind to the Gabl/seconds protein complex but not 
unbound proteins (unbound Gabl or unbound second protein) . The 
15 use of detectable antibodies is well known. In some 
embodiments, one of the proteins is immobilized to a solid 
phase and the second protein is labeled and in solution. The 
labeled protein is contacted with the fixed protein in the 
presence or absence of the test compound and the level of 
20 labeled protein bound to the fixed protein is measured 
following a wash of the test compound and unlabeled protein. 
Some kits are provided which comprise a container with Gabl, 
a container with Grb2, PLC-y, PI-3-kinase or SHPTP2/syp, and 
antibodies which either bind to bound proteins but not unbound 
25 or antibodies which bind to either unbound Gabl or unbound 
Grb2, PLC-?, PI-3 -kinase or SHPTP2/syp but not bound proteins. 
Either Gabl or Grb2, PLC-7, PI-3-kinase, or SHPTP2/syp are 
fixed to a solid substrate. Some kits are provided which 
comprise a container with Gabl fixed to a solid phase and a 
30 container with labeled Grb2, PLC-y, PI-3-kinase or SHPTP2/syp. 
Some kits are provided which comprise a container with labeled 
Gabl and a container with Grb2, PLC-?, PI-3-kinase or 
SHPTP2/syp fixed to a solid phase. Kits may optionally have 
positive and/or negative controls. Optionally, such kits may 
35 also have instructions for performing such assays. 

In some embodiments of the invention, methods and kits 
are provided for identifying compounds that modulate PI -3 
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kinase protein binding to Gabl protein. When Gab-1 is 
phosphorylated, it binds to PI-3 kinase. Gab-1 is a substrate 
for epidermal growth factor receptor (EGFR) , insulin receptor, 
insulin growth factor l(IGF-l) receptor, platelet derived 
5 growth factor (PDGF) receptor, hepatocyte growth factor (HGF) 
receptor, TrkA receptor, IL-3 receptor, B cell receptor, or 
keratinocyte growth factor (KGF) receptor. Thus, by containing 
the receptor ligand to cells with the receptor such as for 
example by contacting cells that have EGFRs with EGF or cells 

10 with insulin receptors with insulin, Gabl will be 
phosphorylated and bind to PI-3 kinase. Gabl binding to PI-3 
kinase is detectable by many different methods including 
isolation of the Gabl complexes using anti-Gabl antibodies. 
Anti-PI-3 kinase antibodies can be used to detect PI-3 kinase 

15 bound to Gabl. Alternatively, PI-3 kinase activity in Gabl 
complexes indicates binding of Gabl to PI-3 kinase. According 
to the invention, cells that have a particular receptor are 
contacted with the receptor ligand in the presence (test assay) 
and absence (control assay) of test compounds. The amount of 

20 PI-3 kinase bound to Gabl in the test assay is compared to the 
amount of PI-3 bound to Gabl in the control assay. If PI- 
3/Gabl binding is less the test assay, the test compound is 
indicated to be an inhibitor of PI-3/Gabl binding. Kits 
according to the invention can include containers that comprise 

25 antibodies that bind to PI-3 and/or containers that comprise 
antibodies that bind to Gabl and/or containers comprising 
reagents for detecting PI-3 kinase activity and/or instructions 
for performing the assay. 

According to another embodiment, an assay is provided 

30 to identify compounds that inhibit the phosphorylation of Gabl 
by tyrosine kinases such as, for example but not limited to, 
the EGF and insulin receptors. In some such embodiments, Gabl 
is bound to solid substrate, the reaction buffer contains 32 P- 
7-ATP and tyrosine kinase is added in the presence or absence 
35 of a test compound. Test compounds are identified that result 
in a decrease in the amount of 32 P incorporated into Gabl 
compared to the level of phosphorylation observed in their 
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absence. Some kits are provided which comprise a container 
with Gabl fixed to a solid phase, a container with the reaction 
buffer contains 32 P-7-ATP and container with tyrosine kinase. 
Kits may optionally have positive and/or negative controls. 
5 Optionally, such kits may also have instructions for performing 
such assays. 

Another embodiment provides an assay to identify 
molecules that inhibit the phosphorylation of Gabl by 
serine/threonine kinases such as, but not limited to, MAP 

10 kinase, cdc2 kinase, protein kinase C, casein kinase, and cAMP 
kinase. The assay is identical to that above except that a 
serine /threonine kinase would be used as the source of enzyme. 
The kits are similar except a container with serine/threonine 
kinase is provided rather than tyrosine kinase. Kits may 

15 optionally have positive and/or negative controls. Optionally, 
such kits may also have instructions for performing such 
assays . " 

In some embodiments of the invention, methods and kits 
are provided for identifying compounds that modulate Gabl 

20 phosphorylation by epidermal growth factor receptor (EGFR) , 
insulin receptor, insulin growth factor l(IGF-l) receptor, 
platelet derived growth factor (PDGF) receptor, hepatocyte 
growth factor (HGF) receptor, TrkA receptor, IL-3 receptor, B 
cell receptor, or keratinocyte growth factor (KGF) receptor. 

25 Gab-1 is a substrate for epidermal growth factor receptor 
(EGFR), insulin receptor, insulin growth factor l(IGF-l) 
receptor, platelet derived growth factor (PDGF) receptor, 
hepatocyte growth factor (HGF) receptor, TrkA receptor, IL-3 
receptor, B cell receptor, or keratinocyte growth factor (KGF) 

3C receptor. Thus, using cells that have a particular receptor 
and contacting the cells with the receptor ligand, such as for 
example by contacting cells that have EGFRs with EGF or cells 
with insulin receptors with insulin, Gabl will be 
phosphorylated. Gabl phosphorylation is detectable by many 

35 different methods including electrophoresis. Phosphorylated 
Gabl runs as a different size as compared to unphosphorylated 
Gabl. According to the invention, cells that have a particular 
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receptor are contacted with the receptor ligand in the presence 
(test assay) and absence (control assay) of test compounds. 
The amount of phosphorylated Gabl generated in the test assay 
is compared to the amount of phosphorylated Gabl in the control 
5 assay. If the test assay has less phosphorylated Gabl, 
inhibition of Gabl phosphorylation is indicated, the test 
compound being an inhibitor. Similarly, if the test assay has 
more phosphorylated Gabl, enhancement of Gabl phosphorylation 
is indicated. 

10 Gabl has pleckstrin homology (PH) domains. 

Phosphatidylinositol, 4 , 5-bisphosphate (PIP 2 ) and the $y 
subunits of heterotrimeric G proteins are substrates for the 
PH domain. Additional embodiments of the invention provide 
assays to detect molecules that interfere with the binding of 

15 PIP 2 or jS-y subunits to Gabl. Either Gabl or the substrate is 
fixed to a solid phase and the other of Gabl or the substrate 
is labeled. The binding of Gabl to the substrate in the 
presence or absence of a test compound is measured. In some 
embodiments, the PIP 2 or Py subunits is labeled and free in 

20 solution, and the Gabl is fixed to a solid substrate. The test 
compound is free in solution. Kits are provided which comprise 
a container with Gabl fixed to a solid phase and a container 
with labeled substrate. Some kits are provided which comprise 
a container with labeled Gabl and a container with substrate 

25 fixed to a solid phase. Kits may optionally have positive 
and/or negative controls. Optionally, such kits may also have 
instructions for performing such assays. 

In the various assays of the invention, the preferred 
concentration of test compound is between 1/xM and 500/xM. A 

30 preferred concentration is 10/iM to lOO^M. In some preferred 
embodiments, it is desirable to use a series of dilutions of 

test compounds . 

It is further contemplated that murine Gabl may be 
used in place of human Gabl in^ assays and kits for screening 
35 compounds to identify human Gabl inhibitors, activators or 
substrates . 
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According to another aspect of the invention, 
transgenic animals, particularly transgenic mice, are 
generated. In some embodiments, the transgenic animals 
according to the invention contain a nucleic acid molecule 
5 which encodes human or murine Gabl . Such transgenic mice may 
be used as animal models for studying overexpression of Gabl 
and for use in drug evaluation and discovery efforts to find 
compounds effective to inhibit or modulate the activity of 
Gabl, such as for example compounds for treating neoplasms. 

10 One having ordinary skill in the art using standard techniques, 
such as those taught in U.S. Patent No. 4,873,191 issued 
October 10, 1989 Wagner and U.S. Patent No. 4,736,866 issued 
April 12, 1988 to Leder, both of which are incorporated herein 
by reference, can produce transgenic animals which produce the 

15 human or murine Gabl and use the animals in drug evaluation and 
discovery projects. 

Another aspect of the present invention relates to 
knock-out mice and methods of using the same. In particular, 
transgenic mice may be generated which are homozygous for a 

20 mutated, non- functional Gabl gene which is introduced into them 
using well known techniques. The mice produce no functional 
Gabl and are useful to study the function of Gabl. 
Furthermore, the mice may be used in assays to study the effect 
of test compounds in Gabl deficient animals. The Gabl 

25 deficient mice can be used to determine if, how and to what 
extent Gabl inhibitors will effect the animal and thereby 
address concerns associated with inhibiting the activity of the 
molecule . 

Methods of generating genetically deficient "knock 
30 out" mice are well known and disclosed in Capecchi, M. R. 
(1989) Science 244:1288-1292 and Li, P. et al . (1995) CELL 
80:401-411, which are each incorporated herein by reference. 
The human Gabl cDNA clone can be used to isolate a murine Gabl 
genomic clone. The genomic clone can be used to prepare a Gabl 
35 targeting construct which can disrupt the Gabl gene in the 
mouse by homologous recombination. 
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The targeting construct contains a non- functioning 
portion of the Gabl gene which inserts in place of the 
functioning portion of the native mouse gene. The non- 
functioning insert generally contains an insertion in the exon 
5 that encodes the active region of Gabl. The targeting 
construct can contain markers for both positive and negative 
selection. The positive selection marker allows for the 
selective elimination of cells without it while the negative 
selection marker allows for the elimination of cells that carry 
10 it. 

For example, a first selectable marker is a positive 
marker that will allow for the survival of cells carrying it. 
In some embodiments, the first selectable marker is an 
antibiotic resistance gene such as the neomycin resistance gene 
15 can be placed within the coding sequences of the Gabl gene to 
render it non- functional while additionally rendering the 
construct selectable. The antibiotic resistance gene is within 
the homologous region which can recombine with native 
sequences. Thus, upon homologous reconstruction, the non- 
20 functional and antibiotic resistance selectable gene sequences 
will be taken up. Knock out mice may be used as models for 
studying diabetes and screening compounds for treating 
diabetes . 

The targeting construct also contains a second 
25 selectable marker which is a negative selectable marker. Cells 
with the negative selectable marker will be eliminated. The 
second selectable marker is outside the recombination region. 
Thus, if the entire construct is present in the cell, both 
markers will be present. If the construct has recombined with 
30 native sequences, the first selectable marker will be 
incorporated into the genome and the second will be lost. The 
herpes simplex virus thymidine kinase (HSV tk) gene is an 
example of a negative selectable marker which can be used as 
a second marker to eliminate cells that carry it. Cells with 
35 the HSV tk gene are selectively killed in the presence of 
gangcyclovir . 
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Cells are transfected with targeting constructs and 
then selected for the presence of the first selection marker 
and the absence of the second. Clones are then injected into 
the blastocysts and implanted into pseudopregnant females. 
Chimeric offspring which are capable of transferring the 
recombinant genes in their germline are selected, mated and 
their offspring is examined for heterozygous carriers of the 
recombined genes. Mating of the heterozygous offspring can 
then be used to generate fully homozygous offspring which are 
the Gabl-def icient knock out mouse. 

The present invention relates to methods of and 
compositions for inhibiting the expression of Gabl in cells. 
In one embodiment, antisense oligonucleotides are provided 
which have a nucleotide sequence complementary to a nucleotide 
sequence of mRNA that encodes human Gabl. 

The antisense oligonucleotides of the present 
invention comprise sequences complementary tc regions of human 
Gabl mRNA. The oligonucleotides comprise a sequence 

complementary to a region selected from the sequence of human 
Gabl mRNA. The antisense oligonucleotides include single 
stranded DNA sequence and an antisense RNA oligonucleotide 
produced from an expression vector. Each of the antisense 
oligonucleotides of the present invention are complementary to 
regions of the human Gabl mRNA sequence. 

The antisense oligonucleotides of the present 
invention comprises a sequence complementary to a fragment of 
SEQ ID N0:1. See Ullrich et al., EMBO J. , 1986, 5:2503, which 
is incorporated herein by reference. Contemplated by this 
definition are fragments of oligos wichin the coding sequence 
for Gabl. Oligonucleotides are preferably complementary to a 
nucleotide sequence that is 5-50 nucleotides in length, in some 
embodiments 8-40, more preferably 12-25 nucleotides, in some 
embodiments 10-15 nucleotides and in some embodiments 12-20 
nucleotides . 

In addition, mismatches within the sequences 
identified above, which achieve the methods of the invention, 
such that the mismatched sequences are substantially 
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complementary to the Gabl sequences are also considered within 
the scope of the disclosure. Mismatches which permit 
substantial complement arily to the Gabl sequences will be known 
to those of skill in the art once armed with the present 
5 disclosure. The oligos may also be unmodified or modified by 
methods well known to those having ordinary skill in the art. 

The present invention is also directed to a method of 
inhibiting Gabl expression in mammals comprising administering 
to the mammal an effective . amount of an antisense 

10 oligonucleotide having a sequence which is complementary to a 
region of the Gabl mRNA. 

Methods of administering the antisense oligos of the 
present invention include techniques well known in the art such 
as and not limited to liposomes, plasmid expression, or viral 

15 vector incl&dfng retroviral vectors. In the administration of 
oligos via vectors or plasmids, a non- coding RNA strand of Gabl 
is preferably used in order to produce antisense RNA oligos 
which are expressed by the cell. The RNA oligos then bind Gabl 
sense or coding RNA sequence . 

20 Methods of administering the oligos to mammals include 

liposomes, and may be in a mixture with a pharmaceutically- 
acceptable carrier, selected with regard to the intended route 
of administration and the standard pharmaceutical practice. 
In addition, antibodies, ligands and the like may be 

25 incorporated into the liposomes thereby providing various modes 
of inhibiting Gabl expression. Dosages will be set with regard 
to weight, and clinical condition of the patient. The 
proportional ratio of active ingredient to carrier will 
naturally depend on the chemical nature, solubility, and 

3 0 stability of the compounds, as well as the dosage contemplated. 
The oligos of the present invention will be administered for 
a time - sufficient for the mammals to be free of 
undifferentiated cells and/or cells having an abnormal 
phenctype . 

35 The oligos of the invention may be employed in the 

method of the invention singly or in combination with other 
compounds. The amount to be administered will also depend on 
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such factors as the age, weight, and clinical condition of the 
patient. See Gennaro, Alfonso, ed. # Remington's Pharmaceutical 
Sciences, 18th Edition, 1990, Mack Publishing Co., Easton PA. 

The compounds of the present invention may be 
5 administered by any suitable route, including inoculation and 
injection, for example, intravenous, oral, intraperitoneal, 
intramuscular, subcutaneous, topically, and by absorption 
through epithelial or mucocutaneous linings, for example, 
nasal, oral, vaginal, rectal and gastrointestinal. 

10 The mode of administration of the oligos may determine 

the sites in the organism to which the compound will be 
delivered. For instance, topical application may be 

administered in creams, ointments, gels, oils, emulsions, 
pastes, lotions, and the like. The oligos of the present 

15 invention may be administered alone or will generally be 
administered in admixture with a pharmaceutical carrier 
selected with regard to the intended route of administration 
and standard pharmaceutical practice. For parenteral 
administration, they are best used in the form of sterile 

2 0 aqueous solution which may contain other solutes, for example, 

sufficient salts, glucose or dextrose to make the solution 
isotonic. For oral mode of administration, the present 
invention may be used in the form of tablets, capsules, 
lozenges, troches, powders, syrups, elixirs, aqueous solutions 
25 and suspension, and the like. Various disintegrants such as 
starch, and lubricating agents may be used. For oral 
administration in capsule form, useful diluents are lactose and 
high molecular weight polyethylene glycols. When aqueous 
suspensions are required for oral use, certain sweetening 

3 0 and/or flavoring agents may be added. Forty iig/ml antisense 

oligo was used for in vitro methods of providing oligos in 

media for cell growth in culture. This concentration may be 

extrapolated for in vivo use. The concentration of antisense 

oligonucleotides for in vivo use is about 40/ig/kg body weight. 

35 The in vivo use of the expression vector expressing RNA 

oligonucleotides is determined by the number of transfected 
cells. 
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For in vivo use, the antisense oligonucleotide may be 
combined with a pharmaceutically acceptable carrier, such as 
suitable liquid vehicle or excipient and an optional auxiliary 
additive or additives. The liquid vehicles and excipients are 
5 conventional and commercially available. Illustrative thereof 
are distilled water, physiological saline, aqueous solution of 
dextrose, and the like. For invivo antineoplastic use, the 
antisense oligonucleotides may be administered intravenously. 

In addition to administration with conventional 

10 carriers, antisense oligonucleotides may be administered by a 
variety of specialized oligonucleotide delivery techniques. 
For example, oligonucleotides have been successfully 
encapsulated in unilamellar liposomes. Reconstituted Sendai 
virus envelopes have been successfully used to deliver RNA and 

15 DNA to cells. Arad et al . , Biochem. Biophy. Acta., 1986, 859, 
88-94. 

Since the phosphorylation scatus of Gabl is essential 
to communicating with other effectors, the degree of 
phosphorylation may be used in a diagnostic/prognostic assay. 

20 One method to assay the degree of tyrosine phosphorylation on 
Gabl is to immunoprecipitate Gabl from tumor lysates and to 
measure the binding of an ant i-phospho tyrosine antibody to 
Gabl. Antibodies which selectively bind to phosphorylated Gabl 
but not unphosphorylated Gabl or which bind to unphosphorylated 

25 Gabl but not phosphorylated Gabl may be used to assess the 
amount of each form of Gabl in a sample. Since the tyrosine 
phosphorylation of- Gabl causes a change in mobility of Gabl on 
SDS-PAGE, a second method is to perform Western blots on tumor 
lysates using an ant i -Gabl antibody. This reveals the change 

30 in mobility of Gabl in tumors where presumably tumors would 
show a shift towards a higher apparent molecular weight (slower 
mobility on SDS-PAGE) . Kits may be provided which include 
anti-Gabl antibodies including specific anti-phosphorylated 
Gabl antibodies and/or ant i -unphosphorylated Gabl antibodies. 

35 and instructions for performing the method. Positive and/or 
negative controls may be optionally provided. 



BNSDOCID: <WO 9707B27A1J_> 



WO 97/07827 PCI7US96/13842 

- 32 - 

Gabl is also phosphorylated by the insulin receptor, 
and since it binds PI -3 -kinase and SHPTP2/syp, it is also 
likely to participate in the physiologic response to insulin. 
Hence, Gabl may be involved in diabetes if there is a 
5 deficiency of Gabl in diabetics. This could either be through 
a failure to express this protein or a mutation in the sequence 
leading to decreased phosphorylation. This decrease in 
tyrosine phosphorylation could be assayed in a manner similar 
to that described above. Kits may be provided which include 
10 anti-Gabl antibodies and instructions for performing the 
method. Positive and/or negative controls may be optionally 
provided. 

EXAMPLES 
Example 1 

15 A cDNA called Gabl (Grb2 associated binder- 1) was 

identified using recombinant Grb2 to screen an express cDNA 
library from a human glial tumor. The cDNA library was made 
as follows. Double selected poly A*RNA was isolated from tumor 
D256. First strand synthesis was performed using random 

20 hexamer priming and converted to double stranded cDNA. The 
products were electrophoresed on acrylamide gels and cDNA above 
1 kb was selected for cloning. The vector used for expression 
was XEXlox (Novagen) . Grb2 used to probe the library was 
expressed in bacteria as a fusion protein with glutathione S- 

25 transferase and the phosphorylation site for protein kinase A 
{Ron, D. and Dressier, H. (1992) Biotechniques 13:866-869). 
Briefly, the cDNA for human Grb2 was cloned in- frame into the 
pGEX vector (Pharmacia) which was engineered to also carry the 
site for phosphorylation by protein kinase A. The resulting 

30 GST-Grb2 fusion protein was labeled to a specific activity of 
1 X 10 7 cpm//zg of protein and 5 x 10 s cpm/ml was used to screen 
the library. Purified protein was labeled with 32 P by an in 
vitro kinase reaction and then used to screen an expression 
cDNA library constructed from a glial tumor, 256. Resulting 

35 positives were plaque purified twice and then converted to 
plasmid using the cre-lox system as described by the 
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manufacturer. Six positive clones were identified from -2.5 
x 10 5 plaques screened, 5 of which were unique as determined by 
DNA sequencing. One clone, which contained a 1.6 kb insert 
that did not correspond to any previously described sequence, 
5 was used to rescreen oligo dT and hexamer primed libraries from 
this tumor which resulted in the isolation of 4.2 kb total of 
the transcript and contained a polyadenylation signal followed 
by a poly A tract. 

Sequencing reactions were performed using di-deoxy 
10 terminators coupled to laser f luorochromes (Prism kit, Applied 
Biosystems) and the reactions run on an ABI 373 A automated 
sequencer. Primers were used for sequencing on both strands 
at -200-400 bp intervals. Sequence analysis revealed the 
presence of an initiator methionine followed by a 2 . 1 kb open 
15 reading frame. The complete amino acid sequence for Gabl is 
presented in SEQ ID NO: 2 and shown in Figure 1. Gabl encodes 
a 694 amino acid protein that has a predicted size of 77 kDa . 
Two putative binding sites for the SH3 domains of Grb2 were 
identified that were similar to those found in dynamin and SOS 
20 (Yu, H., et al (1994) Cell 76:933-945). An amino acid homology 
search revealed that Gabl was most similar to the human IRS-1, 
a 130 kDa protein that is one of the major tyrosine 
phosphorylated proteins following insulin receptor stimulation 
(Sun, X.J., et al. (1991) Mature 352:73-44). The highest 
25 degree of homology (31% identity and 44% similarity) was in the 
pleckstrin homology (PH) domain that is found in the N-terminus 
of both proteins (Musacchio, A. et al . (1993) Trends. Biochem. 
Sci. 18:343-344) (Figure 2). PH domains do not show a high 
degree of overall homology, but of the known PH domains the 
30 Gabl and IRS-1 PH domains are the most closely related to each 
other. The similarities between the two proteins extended to 
other structural features. The distal two-thirds of both 
proteins are extremely proline and serine rich resulting in 
numerous potential phosphorylation sites for serine/threonine 
3 5 kinases including cdc2 kinases, protein kinase C, casein kinase 
and MAP kinase (Figure 3). There are 5 predicted sites, for 
cdc2 kinase in Gabl but only one site in IRS-1, while there are 
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not sites for cAMP dependent protein kinase in Gabl but 6 in 
IRS-1. Overall, 47 predicted serine/threonine phosphorylation 
sites can be found in Gabl and 51 sites such in IRS-1. A key 
feature of IRS-1 is the presence of 20 potential tyrosine 
5 phosphorylation sites. These sites can recruit proteins with 
SH2 domains enabling IRS-1 to act as a docking protein in 
signal transduction. It has been shown that Grb2 (Skolnik, 
E.Y., et al. (1993) Science 260:1953 

-1955), PI-3-kinase, Nek (Lee, C.H., et al. (1993) Proc. Natl. 

10 Acad. U.S.A. 90:11713-11717), and SHPTP2/syp (Sun, X.J., et al. 
(1993) Mol. Cell Biol. 13:7418-7428) directly interact with 
IRS-1. Gabl contains 16 potential binding sites for the SH2 
domains of PI-3-kinase, PLC-7, Nek, and SHPTP2/syp which were 
identified in Gabl using the consensus binding motifs for these 

15 domains (Songyang, 2., etal. (1994) Mol. Cell. Biol. 14:2777- 
2785 and Songyang, Z., et al . (1993) Cell 72:767-778) . IRS-1 
does contain a phosphotyrosine binding domain (Gustaf son, T.A. , 
et al. (1995) Mol. Cell Bio. 15:2500-2508) that is not found 
in Gabl, but the attachment to Grb2 provides an alternate means 

20 for Gabl to associate with receptors. Overall, the similar 
number of tyrosine and serine/threonine phosphorylation sites 
in the much smaller Gabl suggest that it is a compressed 
version of IRS-1. 

Northern blots were prepared using total RNA from 

25 tumor 256 or human brain and hybridized with the Gabl cDNA 
probe. Briefly, 16 fig of total RNA was electrophoresed in 1% 
HEPES/f ormaldehyde gels and transferred to nylon membranes. 
Standard hybridization and washing conditions were used. The 
Gabl cDNA probe detected two transcripts of 4.2 and 7.0 kb in 

30 RNA isolated from tumor 256 and human brain. The complete open 
reading frame of Gabl could be accounted for in the 4.2 
transcript . 

Zoo blots were also performed under high stringency 
conditions to evaluate the species conservation of Gabl. 
3 5 Genomic DNA from human, cow, cat, dog, horse, mouse and pig was 
restricted with EcoRI and used to prepare Southern blots. 
Briefly, 10 of genomic DNA was digested with EcoRI, 
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electrophoresed in 1% agarose gels and transferred to nylon 
membranes. Blots were hybridized with the Gabl cDNA. Normal 
stringency hybridization and washing conditions were used. 
Results demonstrated a high degree of conservation across all 
5 species tested, including rodents. 

Gabl was observed to interact with Grb2 . BL21 
bacteria were transformed with an expression plasmid that 
generated a fusion protein between the gene 10 protein and the 
proline/serine rich region of Gabl. 

10 GST fusion protein with either Grb2 or Nek was labeled 

with 32 P and incubated with Western blots containing lysates 
from BL21 bacteria that had been transformed with XExlox 
plasmid encoding for a fragment of Gabl or no insert. Since 
the fusion protein generated by XExlox vector is from gene 10, 

15 there is no potential of dimerization with the GST portion of 
the probes. 

Far Western blots performed with Grb2 as the probe 
identified an -100 kDa protein in cells transformed with the 
Gabl fragment but not in cells transformed with plasmid 
20 containing no insert. Gabl is specifically recognized by Grb2 . 

The apparent size of the fusion protein was - 20 kDa 
greater than expected and was probably due to the high proline 
content of this segment. IRS-l has also been noted to migrate 
as a larger protein in SDS-PAGE. 
25 in contrast, Nek, which contains three SH3 domains, 

did not bind the Gabl fragment indicating that this recognition 
was not a property of SHS3 domains in general. Preliminary 
results show that both SH3 domains of Grb2 bind to Gabl but the 
C- terminal SH3 domain has a higher affinity. - 
30 To confirm that Gabl can interact with native Grb2, 

a GST fusion protein was generated with this same fragment from 
Gabl (GST-Gabl) . This * was used in GST precipitation 
experiments with lysates from A431 cells. A fragment 
containing amino acids 203-689 of Gabl was cloned into the pGEX 
35 vector and used to produce GST-Gabl fusion protein for the 
precipitations. A431 cells were serum starved overnight and 
than left untreated or stimulated with EGF (100 ng/ml) (human, 
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Gibco/BRL) for 10 min. Cells were lysed in a buffer containing 
PBS, 1% Triton, 0.5% deoxycholate, 0.1% SDS, 0.004% NaF, 100 
jig/ml PMSF, 1 /ig/ml aprotinin, 1 fig /ml leupeptin, and 2 mM Na 
orthovanadate and cleared of insoluble material by 
5 centrifugation. 3 fxg of GST-Gabl or GST was incubated with 150 
jig of A431 lysate for 3 0 min. at 4°C, and then incubated with 
20 ill of glutathione beads (Pharmacia) . The pellet was washed 
three times with lysis buffer and resuspended in SDS - PAGE 
sample buffer. One- third of the resulting pellet and 

10 supernatant was electrophoresed on SDS-PAGE. The resulting 
Western blot was incubated with 1 ^g/ m l of a monoclonal 
antibody against Grb2 (Upstate Biotechnology, Inc.), washed and 
then incubated with an 125 I anti -mouse secondary antibody. 
Western blot analysis of the precipitates using antibodies 

15 against Grb2 showed that Grb2 was specifically bound by GST- 
Gabl but not by GST protein. This binding did not require the 
presence of EGF, which was expected since SH3 domain 
interactions are independent of growth factor addition. 

Gabl was determined to be a substrate for tyrosine 

2 0 kinases. The EGF receptor was observed to phosphorylate Gabl. 
A431 cells were serum starved overnight and then left untreated 
or stimulated with EGF for 10 min. Lysates were prepared and 
EGF receptor was then immuncprecipitated and subsequently used 
in a kinase reaction with either GST-Gabl or GST in the 

25 presence of 32 P-ATP. A portion of the reaction was then 
electrophoresed on SDS-PAGE. A band was visible in the GST 
lane but found to represent minor phosphorylation of the IgG 
heavy chain. EGF receptor was immunoprecipitated from A431 
cells that were treated without or with EGF. This revealed 

30 that GST-Gabl, but not GST, was phosphorylated by the EGF 
receptor which was dependent upon the EGF addition. 

Glial tumors frequently express high levels of a 
naturally occurring deletion within the EGF receptor, called 
EGFRvlll. This receptor has been shown to be a constitutively 

35 active kinase (Ekstrand, A.J., et al . (1994) Oncogene 9:2313- 
2320 and Wong, A.J., et al. (1994) Sem. One. 21:139-148). The 
tumor from which Gabl was isolated predominantly expresses 
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EGFRvIII. When tested to determine whether this receptor could 
also phosphorylate Gabl, Gabl was observed to be a substrate 
for the mutant EGF receptor. In vitro kinase assays were 
performed using EGFRvIII immunoprecipitated from HC2 20 d2 
5 cells, an NIH-3T3 cell line transfected with the human cDNA for 
this mutant receptor. The HC2 20 d2 cells were transfected 
with a cDNA for the EGFRvIII and serum starved overnight. 
Because the EGFRvIII is constitutively active no stimulation 
of this receptor was necessary in order to obtain maximal 

10 activity. A431 or HC2 20 d2 cells were serum starved for 18h 
and stimulated with EGF (100 ng/ml) for 10 min. Cells were 
lysed and lysates were then subject to immunoprecipitation with 
an antibody that specifically recognizes this mutant but not 
the wild type EGF receptor. 10 ^g of ant i- human EGF receptor 

15 antibody (Gibco/BRL) or anti-EGFRvIII was first prebound to 20 
/il of protein G/A agarose (Oncogene Science) . 1 mg of cell 
lysate was incubated with the pellet for 2-3h at 4°C. The 
pellet was washed 2 times with the same buffer and then washed 
once with HNTG buffer (150 mM NaCl, 18 mM MgCl 2 , 10% glycerol, 

20 0.1% Triton and 20 mN Hepes (pH 7.5)). Kinase reactions were 
then performed in the presence of 32 P-ATP using this receptor 
and either GST-Gabl or GST using 1 \iq of fusion protein in HNTG 
buffer with 2.7 /zM ATP, 6 mM MnCl 2 , and 10 jiCi of [y- 32 P] ATP 
(6000 Ci/mmol) for 15 min. at 4°C. The reaction was stopped 

25 by the addition of SDS-PAGE sample buffer. One- third of the 
reaction was electrophoresed on SDS-PAGE and visualized by 
autoradiography. GST-Gabl protein, but not GST, was 
specifically phosphorylated by this receptor. 

Because of the similarity with IRS-1, Gabl was 

30 evaluated as a substrate of the insulin receptor. Insulin 
receptor phosphorylation of Gabl was observed. In vitro kinase 
assays were done using purified insulin receptor. Purified 
insulin receptor from rat liver was first subjected to an 
autophosphorylation reaction to avoid interference from the 0 
35 subunit of the insulin receptor (IR-0) . Briefly, i mg/ml 
receptor was incubated in a buffer containing 50 mM Hepes (pH 
7.5), 5 mM MnCl 2 , 2.5 mM ATP, and 100 nM insulin (porcine, 
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Gibco/BRL) at 3 0°C for 30 min. This receptor preparation was 
then used in kinase assays containing either GST or GST-Gabl 
in the presence of 32 P-ATP. Kinase assays were performed using 
100 ng of receptor and 1 \ig of fusion protein in a final volume 
5 of 50 /il using a buffer containing 50 mM Hepes (pH 7.5), 5 mM 
MnCl 2 , 50 jxM ATP and 0.3 mCi of [y- 22 P] ATP (6000 Ci/mmol) for 
30 min. at 25°C. Using these conditions little j3 subunit 
phosphorylation was detected. Results demonstrated that GST- 
Gabl was also specifically and highly phosphorylated by this 
10 kinase. 

To determine if Gabl is tyrosine phosphorylated in 
vivo, Gabl was immunoprecipitated from A431 lysates using an 
affinity purified Gabl antibody and the Western blots were 
incubated with the same antibody. A431 cells were serum 

15 starved overnight and then left untreated or stimulated with 
EGF (100 ng/ml) for 10 min. or insulin (100 nM) for 5 min. 
Cells were then lysed and used directly for SDS-PAGE or then 
immunoprecipitated with antiGabl antibody. 50 fxg of cell 
lysate was run directly on the gel or 1 mg of protein was used 

20 for immunoprecipitation with 10 pg/ml of anti-Gabl antibody. 
Western blots were incubated with 1 /xg/ml of anti-Gabl antibody 
or 2 fig/ml of anti-phosphotyrosine antibody (4G10, U.B.I.) 
followed by 125 I labeled secondary antibody against either 
rabbit antibody (anti-Gabl) or mouse antibody (4G10) for 

25 detection. The anti-Gabl antibody was raised in rabbits by 
immunizing with the GST-Gabl protein. The resulting antisera 
was affinity purified using GST-Gabl coupled to agarose 
(AminoLink, Pierce) and any contaminating antibodies directed 
against GST were removed by passage over a GST affinity column. 

3 0 Results demonstrated that Gabl is tyrosine phosphorylated in 
response to EGF or insulin stimulation. Endogenous Gabl 
migrated as an -115 kDa protein in unstimulated cells. This 
slower than predicted mobility may be due to a combination of 
its proline rich nature and serine/threonine phosphorylation 

35 in vivo. The addition of EGF resulted in a further decrease 
in mobility to an apparent size of -120 kDa. When similar 
blots were incubated with an anti-phosphotyrosine antibody only 
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the band in EGF stimulated cells was detected. Similar 
experiments were performed using insulin to stimulate A431 
cells. Consistent with the in vitro assays, Gabl was tyrosine 
phosphorylated following the addition of insulin. The degree 
5 of phosphorylation on Gabl appeared similar regardless of which 
growth factor was used. 

Having established that Gabl is tyrosine 
phosphorylated in vivo, Gabl was investigated to determine if 
it could act as a docking protein for SH2 -proteins . 
10 Western blots containing anti-Gabl 

immunoprecipi tat ions from unstimulated or EGF stimulated A431 
cells were incubated with antibodies to Nek, PI -3 -kinase, PLC- 
y , SHPTP2/syp as well as Grb2 . A431 cells were prepared as 
described above and then 1 mg of protein was used for 
15 immunoprecipitations with anti-Gabl antibody. The blots were 
divided and the appropriate portion incubated with antibodies 
against Grb2, Nek, PLC-y, PI -3 -kinase, or SHPTP2/syp. Primary 
antibodies were used at a concentration of 1 jig/ml. The 
antibodies used were: anti-Grb2 (U.B.I. ); anti-PI-3-kinase 
20 (Transduction Laboratories) ; anti-PLC-7 (U.B.I.); anti-SHPTP2 
(Transduction Laboratories); and anti-Nck (Transduction 
Laboratories) . 125 I ant i -mouse antibody was used for detection 
for all these antibodies. PI-3-kinase, PLC-7, and SHPTP2/syp 
were all readily detected in immunoprecipitations from EGF 
25 treated cells. A band was visible in the immunoprecipitation 
lanes with the anti-Nck antibody. It was determined to be due 
to a cross-reaction from the heavy chain of the anti-Gabl 
antibody. As expected, Grb2 showed an association with Gabl 
in the absence of growth factor, but this association was 
30 strongly enhanced by the addition of EGF suggesting further 
interaction via the SH2 domain. Grb2 has also been noted to 
associate with Vav or C-abl by both SH2 and SH3 domain binding. 

To confirm this observation, Far Western blots were 
performed which verified that the SH2 domain of Grb2 die 
35 recognize Gabl immunoprecipitated from EGF treated cells. 
Several other proteins with SH2 domains, specifically, the SH2 
domains from PLC-y and PI -3 -kinase , bur not full length Nek, 
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exhibited strong binding to Gabl immunoprecipitated from EGF 
stimulated cells. Portions of the blot containing Gabl were 
incubated with GST fusion proteins containing the SH2 domains 
from Grb2 (Grb2-SH2), both SH2 domains of PLC-7 (PLC-SH2 ) , the 
5 N-terminal SH2 domain of PI-3-kinase (PI-3-K (N) SH2 ) or the C- 
terminal domain (PI-3-K(C) SH2) , or total Nek. Far Western 
blots were incubated with 5 tig/ml of the GST fusion protein, 
followed by ant i -GST antibody at 1 fig/ml (Santa Cruz 
Biotechnology) , and then 125 I anti-mouse antibody for detection. 

10 The portions of the amino acid sequence that these fusion 
proteins encompassed were: Grb2-SH2, a. a. 50-161 of human 
Grb2; PI-3-K(N)SH2, a. a. 321-440, and PI-3-K(C) SH2, a. a. 614- 
724 of human PI-3-kinase; and PLC-SH2 , a. a. 540-797 of human 
PLC-7 X . Identical patterns of SH2-protein association were 

15 obtained when the A431 cells were stimulated with insulin. 

Tyrosine phosphorylation of Gabl mediates interaction 
with several proteins that contain SH2 domains . The range of 
proteins that associate with Gabl is similar to that of IRS-1 
except that IRS-1 binds Nek but fails to bind PLC-y (White, 

20 M.F. (1994) Curr. Opin. Genet, and Dev. 4:47-54 and Myers, M.G. 
and White, M.F. (1993) Diabetes 42 : 643-650) . 

Mult i -site docking proteins could play an important 
role in integrating signals from proteins that contain SH2 
domains. Previously, it had not been clear if docking proteins 

25 were necessary for only a few RPTKs or were a more general 
feature of RPTK mediated signaling. Typically RPTKs possess 
multiple autophosphorylation sites that can recruit SH2- 
proeeins. The insulin and IGF-1 receptors are unusual in that 
autophosphorylation does not result in binding of SH2-proteins 

3 0 directly to the receptor so the presence of IRS-1 is critical 
for signal transmission. However, IRS-1 is specific for these 
RPTKs since it is not phosphorylated by the EGF or PDGF 
receptors and IL-4 is the only other growth factor known to 
cause phosphorylation (Wang, L.-M., et al . (1993) Proc. Natl. 

35 Acad. Sci. U.S.A. 90:4032-4036). Two related molecules, 4 PS 
and IRs-2 (Tamemoto, H., et al. (1994) Mature 372:182-186, 
Araki, e., e t al. (1994) Nature 372:186-190 and Tobe, K., et 
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al. (1995) J". Biol. Chem. 270:5698-5701), may have a similar 
specificity. The only other potential docking protein 
described is pl30 cas but it primarily has sites for the SH2 
domain of c-crk. (Sakai, R. , et al. (1994) EMBO J. 13:3748- 
5 3756) . The identification of Gabl shows that a docking protein 
is found in the EGF receptor signaling pathway. Other such 
proteins may be found for other tyrosine kinase receptors. 

Unlike IRS-1, Gabl is the target of the SH3 domains 
of Grb2. Since the SH2 domain of Grb2 can bind to a wide 
10 variety of receptor tyrosine kinases, Gabl may be positioned 
downstream of multiple receptors. The SH2 domain of Grb2 can 
also interact with focal adhesion kinase (FAK) and this 
mediates signaling by integrins with the ras pathway 
(Schlaepfer, D.D. et al. (1994) Nature 372 : 786-791) . Docking 
15 proteins may also have a role in integrin signaling as insulin 
can promote the associate of IRS-1 with of v /3 3 , although this 
interaction appears to be specific for this integrin (Vuori, 
K. and Ruoslahti, E. (1994) Science 266:1576-1578). A FAK- 
Grb2-Gabl complex may be a more universal means for involving 
20 docking proteins in integrin signaling. 

PH domain and multiple sites for serine/threonine 
kinases are present in Gabl. Gabl may integrate pathways not 
directly related to tyrosine kinase signaling. The PH domains 
from ^-adrenergic receptor kinase, rasGAP, PLC-y and BTK have 
25 been shown to bind the py subunits of G-proteins and this has 
been localized to the carboxy terminal half of the PH domain 
for the latter three proteins (Touhara, K. et al. (1994) J. 
Biol. Chein. 269:10217-10220). Gabl may regulate PH domain 
proteins linking G-proteins signaling with the ras/MAP kinase 
30 pathway. The amino terminal half of several PH domains can 
bind to phosphatidylinositol-4,5-bisphosphate (Crespo, P.J., 
et al. (1994) Nature 369:418-420). Gabl may be involved in co- 
ordinating or otherwise controlling the interaction of PLC-T 
with its substrate. Phosphorylation by serine/threonine 
35 kinases may provide a means for modulating SH2 or PH domain 
based interactions with Gabl. 
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The fact that Gabl is a substrate of the EGF receptor 
indicates a role in mediating cell growth and neoplasia. Gabl 
is overexpressed in several glial tumors as compared to normal 
brain. Since it is also phosphorylated by the insulin receptor 
5 it may be involved in the cellular response to insulin. Mice 
in which the IRS-1 gene has been disrupted by homologous 
recombination show growth retardation but few other 
abnormalities which may be due to compensatory expression of 
another docking protein such as Gabl. 
10 Example 2 

PI -3 -kinase is an enzyme that plays an important role 
in cell growth and glucose metabolism. While PI-3-kinase is 
activated following the addition of EGF or insulin to cells, 
PI -3 -kinase does not bind to the EGF or insulin receptor 

15 directly. Instead, these receptors phosphorylate other 
substrates to which PI-3-kinase binds to. It has been shown 
that in certain cells IRS-1 is the major binding protein for 
PI-3 -kinase following insulin addition. 

Experiments were performed to evaluate whether Gabl 

20 serves a similar function in EGF signaling. PI-3 K was assayed 
using 10 jig anti-EGF receptor. 5/xg anti-Gabl and 15^ig anti- 
phosphotyrosine antibody (PY20: Transduction Laboratories) were 
unsed for itnmunoprecipitations. Assays were run at least 5 
times in duplicate. For transfections, full-length Gabl cDNA 

25 was cloned into pLTR2 and this construct or vector only were 
co-transf ected with pKOneo plasmid into NIH3T3 cells. G418- 
resistant clones (500/xg/ml' 1 ) were subcloned twice by limiting 
dilution. Cell growth was assayed by seeding 50 , 000 cells per 
3 5 -mm well in DMEM containing 1% calf serum, G418 and the 

30 growth factor. For growth in soft agarose, 2,000 cells were 
suspended in 1ml of medium containing 0.3% agarose (low-melting 
Sigma) , 10% calf serum, G418 and growth factor, and seeded over 
a 2 -ml 0.6% agarose layer in 35-mm dishes. Cells were fed 
weekly with 1ml suspension medium. After three weeks the 

35 number of colonies larger than 60/im was counted. For the MAP 
kinase mobility shift assay, cell lysates were electrophoresed 
in 8.5% gels and the subsequent western blots were incubated 
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with an anti-MAP kinase antibody (anti-pan ERK, Transduction 
Laboratories) which consistently detected ERK2 in these cells. 
Anti-Sos immunoprecipi tat ions were done using 5/ig of a mouse 
monoclonal anti-body (Transduction Labs) . The results are 
5 shown in Figures 4A-4J. 

Figures 4A and 4B show results from experiments in 
which PI -3 Kinase was assayed on immunoprecipitates (IP) 
produced by the indicated antibodies (Ab) on lysates from 
untreated, EGF- stimulated (A431) or insulin- stimulated (3T3-L1) 
10 cells, PIP, phosphatidyl -inositol -3 -phosphate. 

Figures 4C and 4D show Quantification of the PI(3)K assay. 
Bars represent average 32 P c.p.m. values (Cerenkov counts) with 
s.d. Numbers above the bars designate average c.p.m. 

Figures 4E ( 4F, and 4G show results of Cell -growth 
15 assays on the Pilar9 (squares) and the Pilarl2 (triangles) cell 
lines (which express Gabl at 13- and 8- fold over control 
(diamonds) cells, respectively) or a vector-only transfected 
cell line (control) in 1% serum without additional growth 
factor (-GF) or in the presence of EGF or insulin. 
20 In Figure 4H, cell lines were seeded in soft agarose 

and grown in 10% serum without additional growth factor (-GF) 
or in the presence of EGF or insulin. 

In Figure 41, cell lines were serum-starved and then 
treated with EGF or insulin for the times indicated. Western 
25 blots were incubated with an antibody against MAP kinase (p42) . 
Phosphorylated ERK2, which corresponds to the active form, is 
seen as a slower migrating band (pp-42) . 

In Figure 4J, lysates from the cell lines were 
immunoprecipitated with antibodies against Sos or Gabl. The 
30 subsequent western blots were incubated with antibodies against 
Sos . Gabl or Grb2 . 

The results in Figures 4A-4D confirm that EGF 
stimulation did not produce a substantial increase in PI-3- 
kinase activity in anti-EGF receptor immunoprecipitates in A431 
35 cells. In addition, these data show that there was PI -3 -kinase 
activity in anti-Gabl immunoprecipitates in unstimulated cells 
that increased 2.3 fold upon EGF addition. To measure the 
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overall PI-3-kinase activity of these cells, anti- 
phosphotyrosine immunoprecipi tat ions were performed. The anti- 
Gabl associated activity was 42% of that found in anti- 
phosphotyrosine immunoprecipitates. In A431 cells, the EGF 
5 receptor can transphosphorylate erbB3 and this constitutes -49% 
of the anti-phosphotyrosine associated activity indicating that 
Gabl and erbB3 are two of the major PI -3 -kinase binding 
proteins after EGF stimulation. Insulin addition produced a 
1.7 fold increase in Gabl immunoprecipitates. 3T3-L1 

10 fibroblasts were also examined for insulin stimulated activity. 
While the magnitude of the response was lower, insulin produced 
a 2.4 fold increase in activity. This was 71% of that 
associated with anti-phosphotyrosine indicating that Gabl was 
the major binding partner in these cells following insulin 

15 addition. When 3T3-L1 are differentiated into adipocytes PI-3- 
kinase activity is mainly associated with IRS-1 suggesting a 
differentiation state specific role for these two docking 
proteins . 

Both EGF and insulin induce mitogenesis and analysis 

2 0 was made to determine if overexpression of Gabl might augment 
cell growth. NIH-3T3 cells were transfected with the Gabl cDNA 
and two clones, Pilar9 and Pilarl2, were selected for further 
study. The data in Figures 4E-4G show that both cell lines had 
an enhanced growth rate and achieved a high cell density in 1% 

25 serum whereas control cells failed to proliferate. Growth 
factor addition did not significantly alter growth, except for 
Pilar9 where EGF further stimulated growth at high density. 
These data suggest that Gabl overexpression rendered NIH-3T3 
cells more sensitive to the limiting amount of growth factors 

30 present in serum. One measure of transformation is anchorage 
independent growth. As shown in Figure 4H, both the Pilar9 and 
Pilarl2 clones exhibited significant colony formation in soft 
agarose as compared to the control cells. This was dependent 
on growth factor addition indicating that activation of other 

35 proteins was necessary for transformation. Collectively, these 
data suggest that increased expression of Gabl could facilitate 
tumorigenesis by receptor protein tyrosine kinases (RPTKs) . 
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Since SOS and Grb2 can be co-precipitated with IRS-1, 
it has been suggested that IRS-1 participates in the activation 
of MAP kinase via the ras pathway. The Gabl transfected clones 
were examined to determine if they showed any enhancement of 
5 MAP kinase activation. As shown in Figure 41, control cells 
showed a large and sustained increase in MAP kinase 
phosphorylation following addition of growth factor, but the 
Pilar9 and Pilarl2 clones showed only a small and attenuated 
duration of activation. We then examined if SOS could co- 

10 precipitate with Gabl. As shown iun Figure 4J, while Grb2 was 
found in anti-SOS or ant i -Gabl immunoprecipitates , SOS could 
not be detected in anti-Gabl complexes nor could Gabl be found 
in anti-SOS complexes. The amount of Grb2 that co-precipitated 
with SOS was decreased in the Pilar9 and Pilarl2 clones, so an 

15 explanation for the decrease in MAP kinase activity is that 
Gabl competes with SOS for binding to Grb2 . While it is not 
clear if Gabl regulates SOS and the ras pathway under normal 
conditions, these results show that overexpression of Gabl does 
not enhance MAP kinase activation and that Gabl and SOS form 

20 separate complexes with Grb2. These findings are consistent 
with recent data showing that IRS-1 is not directly involved 
in MAP kinase activation by insulin. 
Example 3 

Gabl has been observed to be a substrate for the 
25 insulin-like growth factor l(IGF-l) receptor, platelet derived 
growth factor (PDGF) receptor, hepatocyte growth factor (HGF) 
receptor, TrkA receptor, IL-3 receptor, B cell receptor, and 
keratinocyte growth factor (KGF) receptor. This implicates 
Gabl in potentially a wide variety of physiologic situations. 
30 Two receptors where Gabl may play a particularly central role 
are the HGF and TrkA receptors. Specifically, following 
activation of the HGF receptor, it has been observed that Gabl 
is the major tyrosine phosphorylated substrate of this 
receptor. This is an important finding as the HGF receptor has 
35 been shown to play an essential role in the genesis of 
hepatocellular carcinoma. In development, deletion of the HGF 
receptor in a mouse knock-out model results in loss of 
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migration of certain muscle cells such as the ones that 
constitute the diaphragm and the long muscles of the body. The 
TrkA receptor is the receptor for the 2.5S form of nerve growth 
factor (NGF) . Both NGF and the TrkA receptor are required for 
5 the proper development of the sympathetic nervous system as in 
mice which have undergone a homozygous deletion for either gene 
lack sympathetic neurons. A 115 kd protein has been described 
as one of the major tyrosine phosphorylated substrates of the 
TrkA receptor. This 115 kd protein is Gabl. The activation 
10 of the enzyme PI -3 -kinase by the TrkA receptor is also 
essential for normal neuronal cell function. Gabl acts as the 
major site of recruitment of PI -3 -kinase activity in these 
cells following activation of TrkA receptor by NGF addition. 
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TGTCGGGAGT CGCGCCCGCC GCCCCTCAGC TGCCCGGCCC GGAGCCCGAG ACGCGCGCAC 120 

C ATG AGC GGT GGT GAA GTG GTC TGC TCC GGA TGG CTC CGC AAG TCC 166 
Met Ser Gly Gly Glu Val Val Cys Ser Gly Trp Leu Arg Lys Ser 
15 10 15 

CCC CCG GAG AAA AAG TTG AAG CGT TAT GCA TGG AAG AGG AGA TGG TTC 214 
Pro Pro Glu Lys Lys Leu Lys Arg Tyr Ala Trp Lys Arg Arg Trp Phe 
20 25 30 

GTG TTA CGC AGT GGC CGT TTA ACT GGA GAT CCA GAT GTT TTG GAA TAT 262 
Val Leu Arg Ser Gly Arg Leu Thr Gly Asp Pro Asp Val Leu Glu Tyr 
35 40 45 

TAC AAA AAT GAT CAT GCC AAG AAG CCT ATT CGT ATT ATT GAT TTA AAT 310 
Tyr Lys Asn Asp His Ala Lys Lys Pro He Arg He ,Ile Asp Leu Asn 
50 55 60 

TTA TGT CAA CAA GTA GAT GCT GGA TTG ACA TTT AAC AAA AAA GAG TTT 358 
Leu Cys Gin Gin Val Asp Ala Gly Leu Thr Phe Asn Lys Lys Glu Phe 
65 70 75 

GAA AAC AGC TAC ATT TTT GAT ATC AAC ACT ATT GAC CGG ATT TTC TAC 406 
Glu Asn Ser Tyr He Phe Asp He Asn Thr He Asp Arg lie Phe Tyr 
80 85 90 95 

TTG GTA GCA GAC AGC GAG GAG GAG ATG AAT AAG TGG GTT CGT TGT ATT 454 
Leu Val Ala Asp Ser Glu Glu Glu Met Asn Lys Trp Val Arg Cys He 
100 105 110 

TGT GAC ATC TGT GGG TTT AAT CCA ACA GAA GAA GAT. CCT GTG AAG CCA 502 
Cys Asp He Cys Gly Phe Asn Pro Thr Glu Glu Asp Pro Val Lys Pro 
115 120 125 

CCT GGC AGC TCT TTA CAA GCA CCA GCT GAT TTA CCT TTA GCT ATA AAT 550 
Pro Gly Ser Ser Leu Gin Ala Pro Ala Asp Leu Pro Leu Ala He Asn 
130 135 140 

ACA GCA CCA CCA TCC ACC CAG GCA GAT TCA TCC TCT GCT ACT CTA CCT 598 
Thr Ala Pro Pro Ser Thr Gin Ala Asp Ser Ser Ser Ala Thr Leu Pro 
145 150 155 

CCT CCA TAT CAG CTA ATC AAT GTT CCA CCA CAC CTG GAA ACT CTT GGC 646 
Pro Pro Tyr Gin Leu He Asn Val Pro Pro His Leu Glu Thr Leu Gly 
160 165 170 175 

ATT CAG GAG GAT CCT CAA GAC TAC CTG TTG CTC ATC AAC TGT CAA AGC 694 
He Gin Glu Asp Pro Gin Asp Tyr Leu Leu Leu He Asn Cys Gin Ser 
180 185 190 

AAG AAG CCC GAA CCC ACC AGA ACG CAT GCT GAT TCT GGA AAA TCC ACC 742 
Lys Lys Pro Glu Pro Thr Arg Thr His Ala Asp Ser Gly Lys Ser Thr 
195 200 205 

TCT TCT GAA ACA GAC TCC AAT GAT AAC GTC CCT TCT CAT AAA AAT CCT 790 
Ser Ser Glu Thr Asp Ser Asn Asp Asn Val Pro Ser His Lys Asn Pro 
210 215 220 

GCT TCC TCC CAG AGC AAA CAT GGA ATG AAT GGC TTT TTT CAG CAG CAA 838 
Ala Ser Ser Gin Ser Lys His Gly Met Asn Gly Phe Phe Gin Gin Gin 
225 230 235 

ATG ATA TAC GAC TCT CCA CCT TCA CGT GCC CCA TCT GCT TCA GTT GAC 886 
Met ll e Tyr Asp Ser Pro p ro Ser Arg Ala Pro Ser Ala Ser Val Asp 
240 245 250 255 

TCC AGC CTT TAT AAC CTG CCC AGG AGT TAT TCC CAT GAT GTT TTA CCA 934 



3NSDOC10: <WO_9707827A1_I_ 



WO 97/07827 



PCT/US96/13842 



- 49 - 

Ser Ser Leu Tyr Asn Leu Pro Arg Ser Tyr Ser His Asp Val Leu Pro 
260 265 270 

AAG GTG TCT CCA TCA AGT ACT GAA GCA GAT GGA GAA CTC TAT GTT TTT 982 
Lys Val Ser Pro Ser Ser Thr Glu Ala Asp Gly Glu Leu Tyr Val Phe 
275 280 285 

AAT ACC CCA TCT GGG ACA TCG AGT GTA GAG ACT CAA ATG AGG CAT GTA 1030 
Asn Thr Pro Ser Gly Thr Ser Ser Val Glu Thr Gin Met Arg His Val 
290 295 300 

TCT ATT AGT TAT GAC ATT CCT CCA ACA CCT GGT AAT ACT TAT CAG ATT 1078 
Ser lie Ser Tyr Asp lie Pro Pro Thr Pro Gly Asn Thr Tyr Gin He 
305 310 315 

CCA CGA ACA TTT CCA GAA GGA ACC TTG GGA CAG ACA TCA AAG CTA GAC 1126 
Pro Arg Thr Phe Pro Glu Gly Thr Leu Gly Gin Thr Ser Lys Leu Asp 
320 325 330 335 

ACT ATT CCA GAT ATT CCT CCA CCT CGG CCA CCG AAA CCA CAT CCA GCT 1174 
Thr He Pro Asp He Pro Pro Pro Arg Pro Pro Lys Pro His Pro Ala 
340 345 350 

CAT GAC CGA TCT CCT GTG GAA ACG TGT AGT ATC CCA CGC ACC GCC TCA 1222 
His Asp Arg Ser Pro Val Glu Thr Cys Ser He Pro Arg Thr Ala Ser 
355 360 365 

GAC ACT GAC AGT AGT TAC TGT ATC CCT ACA GCA GGG ATG TCG CCT TCA 12 70 

Asp Thr Asp Ser Ser Tyr Cys He Pro Thr Ala Gly Met Ser Pro Ser 
370 375 380 

CGT AGT AAT ACC ATT TCC ACT GTG GAT TTA AAC AAA TTG CGA AAA GAT 1318 
Arg Ser Asn Thr He Ser Thr Val Asp Leu Asn Lys Leu Arg Lys Asp 
385 390 395 

GCT AGT TCT CAA GAC TGC TAT GAT ATT CCA CGA GCA TTT CCA AGT GAT 1366 
Ala Ser Ser Gin Asp Cys Tyr Asp He Pro Arg Ala Phe Pro Ser Asp 
400 405 410 415 

AGA TCT AGT TCA CTT GAA GGC TTC CAT AAC CAC TTT AAA GTC AAA AAT 1414 
Arg Ser Ser Ser Leu Glu Gly Phe His Asn His Phe Lys Val Lys Asn 
420 425 430 

GTG TTG ACA GTG GGA AGT GTT TCA AGT GAA GAA CTG GAT GAA AAT TAC 146 2 

Val Leu Thr Val Gly Ser Val Ser Ser Glu Glu Leu Asp Glu Asn Tyr 
435 440 445 

GTC CCA ATG AAT CCC AAT TCA CCA CCA CGA CAA CAT TCC AGC AGT TTT 1510 
Val Pro Met Asn Pro Asn Ser Pro Pro Arg Gin His Ser Ser Ser Phe 
450 455 460 

ACA GAA CCA ATT CAG GAA GCA AAT TAT GTG CCA ATG ACT CCA GGA ACA 1558 
Thr Glu Pro He Gin Glu Ala Asn Tyr Val Pro Met Thr Pro Gly Thr 
465 470 475 

TTT GAT TTT TCC TCA TTT GGA ATG CAA GTT CCT CCT CCT GCT CAT ATG 1606 
Phe Asp Phe Ser Ser Phe Gly Met Gin Val Pro Pro Pro Ala His Met 
480 485 490 495 

GGC TTC AGG TCC AGC CCA AAA ACC CCT CCC AGA AGG CCA GTT CCT GTT 16 54 

Gly Phe Arg Ser Ser Pro Lys Thr Pro Pro Arg Arg Pro Val Pro Val 
500 505 510 

GCA GAC TGT GAA CCA CCC CCC GTG GAT AGG AAC CTC AAG CCA GAC AGA 1.702 
Ala Asp Cys Glu Pro Pro Pro Val Asp Arg Asn Leu Lys Pro Asp Arg 
515 520 525 
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AAA GTC AAG CCA GCG CCT TTA GAA ATA AAA CCT TTG CCA GAA TGG GAA 1750 
Lys Val Lys Pro Ala Pro Leu Glu lie Lys Pro Leu Pro Glu Trp Glu 
530 535 540 

GAA TTA CAA GCC CCA GTT AGA TCT CCC ATC ACT AGG AGT TTT GCT CGA 1798 
Glu Leu Gin Ala Pro Val Arg Ser Pro He Thr Arg Ser Phe Ala Arg 
545 550 555 

GAC TCT TCC AGG TTT CCC ATG TCC CCC CGA CCA GAT TCA GTG CAT AGC 1846 
Asp Ser Ser Arg Phe Pro Met Ser Pro Arg Pro Asp Ser Val His Ser 
560 565 570 575 

ACA ACT TCA AGC AGT GAC TCA CAC GAC AGT GAA GAG AAT TAT GTT CCC 1894 
Thr Thr Ser Ser Ser Asp Ser His Asp Ser Glu Glu Asn Tyr Val Pro 
580 585 590 

ATG AAC CCA AAC CTG TCC AGT GAA GAC CCA AAT CTC TTT GGC AGT AAC 1942 
Met Asn Pro Asn Leu Ser Ser Glu Asp Pro Asn Leu Phe Gly Ser Asn 
595 600 605 

AGT CTT GAT GGA GGA AGC AGC CCT ATG ATC AAG CCC AAA GGA GAC AAA 1990 
Ser Leu Asp Gly Gly Ser Ser Pro Met He Lys Pro Lys Gly Asp Lys 
610 615 620 

CAG GTG GAA TAC TTA GAT CTC GAC TTA GAT TCT GGG AAA TCC ACA CCA 2038 
Gin Val Glu Tyr Leu Asp Leu Asp Leu Asp Ser Gly Lys Ser Thr Pro 
625 630 635 

CCA CGT AAG CAA AAG AGC AGT GGC TCA GGC AGC AGT GTA GCA GAT GAG 2086 
Pro Arg Lys Gin Lys Ser Ser Gly Ser Gly Ser Ser Val Ala Asp Glu 
640 645 650 655 

AGA GTG GAT TAT GTT GTT GTT GAC CAA CAG AAG ACC TTG GCT CTA AAG 2134 
Arg Val Asp Tyr Val Val Val Asp Gin Gin Lys Thr Leu Ala Leu Lys 
660 665 670 

AGT ACC CGG GAA GCC TGG ACA GAT GGG AGA CAG TCC ACA GAA TCA GAA 2182 
Ser Thr Arg Glu Ala Trp Thr Asp Gly Arg Gin Ser Thr Glu Ser Glu 
675 680 685 

ACG CCA GCG AAG AGT GTG AAA TGAAAATATT GCCTTGCCAT TTCTGAACAA 2233 
Thr Pro Ala Lys Ser Val Lys 
690 

AAGAAAACTG AATTGTAAAG ATAAATCCCT TTTGAAGAAT GACTTGACAC TTCCACTCTA 2293 

GGTAGATCCT CAAATGAGTA GAGTTGAAGT CAAAGGACCT TTCTGACATA ATCAAGCAAT 2353 

TTAGACTTAA GTGGTGCTTT GTGGTATCTG AACAATTCAT AACATGTAAA TAATGTGGGA 2413 

AAATAGTATT GTTTAGCTCC CAGAGAAACA TTTGTTCCAC AGTTAACACA CTCG 2467 

(2) INFORMATION FOR SEQ ID NO: 2: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 694 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Ser Gly Gly Glu Val Val Cys Ser Gly Trp Leu Arg Lys Ser Pro 
1 5 10 15 
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Pro Glu Lys Lys Leu Lys Arg Tyr Ala Trp Lys Arg Arg Trp Phe Val 
20 25 30 

Leu Arg Ser Gly Arg Leu Thr Gly Asp Pro Asp Val Leu Glu Tyr Tyr 
35 40 45 

Lys Asn Asp His Ala Lys Lys Pro lie Arg lie He Asp Leu Asn Leu 
50 55 60 

Cys Gin Gin Val Asp Ala Gly Leu Thr Phe Asn Lys Lys Glu Phe Glu 
65 70 75 80 

Asn Ser Tyr He Phe Asp He Asn Thr He Asp Arg He Phe Tyr Leu 
85 90 95 

Val Ala Asp Ser Glu Glu Glu Met Asn Lys Trp Val Arg Cys lie. Cys 
100 105 HO 

Asp He Cys Gly Phe Asn Pro Thr Glu Glu Asp Pro Val Lys Pro Pro 
115 120 125 

Gly Ser Ser Leu Gin Ala Pro Ala Asp Leu Pro Leu Ala He Asn Thr 
130 135 140 

Ala Pro Pro Ser Thr Gin Ala Asp Ser Ser Ser Ala Thr Leu Pro Pro 
145 150 155 160 

Pro Tyr Gin Leu He Asn Val Pro Pro His Leu Glu Thr Leu Gly He 
165 170 175 

Gin Glu Asp Pro Gin Asp Tyr Leu Leu Leu He Asn Cys Gin Ser Lys 
180 185 190 

Lys Pro Glu Pro Thr Arg Thr His Ala Asp Ser Gly Lys Ser Thr Ser 
195 200 205 

Ser Glu Thr Asp Ser Asn Asp Asn Val Pro Ser His Lys Asn Pro Ala 
210 215 220 

Ser Ser Gin Ser Lys His Gly Met Asn Gly Phe Phe Gin Gin Gin Met 
225 230 235 240 

He Tyr Asp Ser Pro Pro Ser Arg Ala Pro Ser Ala Ser Val Asp Ser 
245 250 255 

Ser Leu Tyr Asn Leu Pro Arg Ser Tyr Ser His Asp Val Leu Pro Lys 
260 265 270 

Val Ser Pro Ser Ser Thr Glu Ala Asp Gly Glu Leu Tyr Val Phe Asn 
275 280 285 

Thr Pro Ser Gly Thr Ser Ser Val Glu Thr Gin Met Arg His Val Ser 
290 295 300 

H« Ser Tyr Aso He Pro Pro Thr Pro Gly Asn Thr Tyr Gin He Pro 
305 " 310 315 320 

Arg Thr Phe Pro Glu Gly Thr Leu Gly Gin Thr Ser Lys Leu Asp Thr 
325 330 335 

He Pro Asp He Pro Pro Pro Arg Pro Pro Lys Pro His Pro Ala His 
340 345 350 

Asp Arg Ser Pro Val Glu Thr Cys Ser He Pro Arg Thr Ala Ser Asp 
355 360 365 

Thr Asp Ser Ser Tyr Cys He Pro Thr Ala Gly Met Ser Pro Ser Arg 
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370 



375 



360 



Ser Asn 
385 



Thr He Ser 



Thr Val 
390 



Asp 



Leu Asn Lys 
395 



Leu Arg 



Lys Asp Ala 
400 



Ser Ser Gin Asp Cys Tyr Asp He Pro Arg Ala Phe Pro Ser Asp Arg 
405 410 415 

Ser Ser Ser Leu Glu Gly Phe His Asn His Phe Lys Val Lys Asn Val 
420 425 430 

Leu Thr Val Gly Ser Val Ser Ser Glu Glu Leu Asp Glu Asn Tyr Val 
435 440 445 

Pro Met Asn Pro Asn Ser Pro Pro Arg Gin His Ser Ser Ser Phe Thr 
450 455 460 

Glu Pro He Gin Glu Ala Asn Tyr Val Pro Met Thr Pro Gly Thr Phe 
465 470 475 480 

Asp Phe Ser Ser Phe Gly Met Gin Val Pro Pro Pro Ala His Met Gly 
485 490 495 

Phe Arg Ser Ser Pro Lys Thr Pro Pro Arg Arg Pro Val Pro Val Ala 
500 505 510 

Asp Cys Glu Pro Pro Pro Val Asp Ara Asn Leu Lys Pro Asp Arg Lys 
515 520 " 525 

Val Lys Pro Ala Pro Leu Glu He Lys Pro Leu Pro Glu Trp Glu Glu 
530 535 540 

Leu Gin Ala Pro Val Arg Ser Pro He Thr Arg Ser Phe Ala Arg Asp 
545 550 555 560 

Ser Ser Arg Phe Pro Met Ser Pro Arg Pro Asp Ser Val His Ser Thr 
565 570 575 

Thr Ser Ser Ser Asp Ser His Asp Ser Glu Glu Asn Tyr Val Pro Met 
580 585 590 

Asn Pro Asn Leu Ser Ser Glu Asp Pro Asn Leu Phe Gly Ser Asn Ser 
595 600 605 

Leu Asp Gly Gly Ser Ser Pro Met He Lys Pro Lys Gly Asp Lys Gin 
610 615 620 

Val Glu Tyr Leu Asp Leu Asp Leu Asp Ser Gly Lys Ser Thr Pro Pro 
625 630 635 640 

Arg Lys Gin Lys Ser Ser Gly Ser Gly Ser Ser Val Ala Asp Glu Arg 
645 650 655 

Val Asp Tyr Val Val Val Asp Gin Gin Lys Thr Leu Ala Leu Lys Ser 
660 665 670 

Thr Arg Glu Ala Trp Thr Asp Gly Arg Gin Ser Thr Glu Ser Glu Thr 
675 680 685 

Pro Ala Lys Ser Val Lys 



690 



(2) INFORMATION FOR 



SEQ ID NO: 3 



(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3416 base pairs 

(B) TYPE: nucleic acid 



SNSDOCID: <WO 9707827 A 1 .!_> 



WO 97/07827 



PCT/US96/13842 



- 53 - 

(C) STRANDEDNESS : both 
<D) TOPOLOGY : both 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 335.. 2419 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

CTCACCGCTG GTGGTGTGGT ACCGGATCGA ATTCGGCACG AGGACCGCTG CCTAGGCGGC 60 

GGGACGGCGC GCCTGGCGGC CAGGAGGGCG CACTGAAAGA AGGTCGGCGA GCCCTGGTCC 120 

CCGCGGTTCC CGATCGAGTT CCTCTTCAGT CCGCGAATCT GCGGGAGAGG TTCGATCGCC 180 

GACACAGGGC GCGGGGAGCC GGGCCGCCCC GTCGGGGGAA TCTGAGACGT CCTCTGGGCT 24 0 

GCGTTTGACC GCCGTGCCCG CCGTGCACGG AGCGCGTCCA CTGTGTCCAC CGACCCCTTT 300 

GGTGTCTGGT CCTCGAGTCC TCACGGCGTG CACC ATG AGC GGC GGC GAA GTG 352 

Met Ser Gly Gly Glu Val 
695 700 

GTT TGC TCG GGA TGG CTC CGC AAG TCG CCC CCG GAG AAG AAG TTG AAG 400 
Val Cys Ser Gly Trp Leu Arg Lys Ser Pro Pro Glu Lys Lys Leu Lys 
705 710 715 

CGT TAT GCG TGG AAG AGA AGG TGG TTT GTG TTG CGC AGT GGC CGT TTG 44 8 

Arg Tyr Ala Trp Lys Arg Arg Trp Phe Val Leu Arg Ser Gly Arg Leu 
720 725 730 

ACT GGA GAC CCG GAT GTC CTG GAG TAT TAC AAA AAC GAT CAT GCC AAG 496 
Thr Glv Asp Pro Asp Val Leu Glu Tyr Tyr Lys Asn Asp His Ala Lys 
735 740 745 

AAG CCT ATT CGG ATT ATT GAT TTA AAT TTA TGT CAG CAA GTT GAT GCT 544 
Lys Pro He Arg He He Asp Leu Asn Leu Cys Gin Gin Val Asp Ala 
750 755 760 

GGG TTG ACA TTC AAC AAA AAG GAG TTT GAA AAC AGC TAT ATC TTT GAT 592 
Gly Leu Thr Phe Asn Lys Lys Glu Phe Glu Asn Ser Tyr He Phe Asp 
765 770 775 780 

ATC AAC ACC ATC GAC CGG ATT TTC TAC TTG GTG GCA GAT AGT GAG GAA 640 
He Asn Thr He Asp Arg He Phe Tyr Leu Val Ala Asp Ser Glu Glu 
785 790 795 

GAC ATG AAC AAG TGG GTC CGT TGT ATC TGT GAC ATC TGT GGA TTC AAT 688 
Asp Met Asn Lys Trp Val Arg Cys He Cys Asp He Cys Gly Phe Asn 
800 805 810 

CCC ACA GAA GAA GAT CC? GTG AAG CCG CTG ACT GGC TCC TCA CAA GCA 736 
Pro Thr Glu Glu Asp Pro Val Lys Pro Leu Thr Gly Ser Ser Gin Ala 
815 820 825 

CCC GTC GAT TCA CCT TTC GCT ATA AGT ACA GCA CCA GCC TCC AGT CAG 784 
Pro Val Asp Ser Pro Phe Ala He Ser Thr Ala Pro Ala Ser Ser Gin 
830 835 840 

ATG GAA GCT TCT TCA GTC GCG CTA CCT CCT CCT TAC CAG GTC ATC AGC 832 
Met Glu Ala Ser Ser Val Ala Leu Pro Pro Pro Tyr Gin Val lie Ser 
845 850 855 860 
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CTT 
Leu 


CCG 
Pro 


CCA 
Pro 


CAC 
His 


CCA 
Pro 
865 


GAC 


ACC 
Thr 


CTC 
Leu 


GGC 
Gly 


CTC 
Leu 
870 


CAG 
Gin 


GAC GAT 
Asp Asp 


CCA 
Pro 


CAA 
Gin 
875 


GAC 
Asp 


880 


TAC 
Tyr 


CTC 
Leu 


TTG 

uc u 


CTG 
Leu 
880 


ATC 
lie 


AAC 
Asn 


TGT 


CAA 
Gin 


AGC 
Ser 
885 


AAG 
Lys 


AAG 
Lys 


CCT 
Pro 


GAA 
Glu 


CCT 
Pro 
890 


AAC 
Asn 


AGA 
Ara 


928 


ACC 
Thr 


CTC 
Leu 


TTT 
Phe 
895 


GAC 
Asp 


TCT 
Ser 


GCC 
>vxa 


AAG 

Lys 


CCC 
Pro 
900 


ACC 

1 I IX 


TTT 


TCT 

OCX 


GAG ACA GAC TGC 
Glu Thr Asp Cys 
905 


AAT 
Asn 


976 


GAC GAC 
Asp Asp 
910 


GTC 
Val 


CCT 
Pro 


TCC 
Ser 


CAC 

XlXS 


CAG 
uin 
915 


ACT 
Thr 


CCT 
Pro 


GCT 

nia 


TCC 

C A V 
OCX 


TCC 
Ser 
920 


CAG 
Gin 


AGC 
Ser 


AAA 
Lys 


CAC 


1024 


GGA ATG AAT GGC TTT 
Gly Met Asn Gly Phe 
925 


TTC 
rile 
930 


CAG 


CAA 
Gin 


CAA 
Gin 


ATG 

net 


ATG 

Mot 

935 


TAT GAC TGC 
Tyr Asp Cys 


CCA 
Pro 


CCG 
Pro 
940 


1072 


TGC 
Cys 


CGG 
Arg 


CTG 
Leu 


ACA 
Thr 


TCT 
Ser 
94 5 


GTC 
va j. 


TCG 
Ser 


GGA GAG 
Gly Glu 


TCC 
ber 
950 


AGC 
Ser 


CTC 
Leu 


TAT AAC 
Tyr Asn 


CTG 
Leu 
955 


CCC 
Pro 


1120 


AGG 
Arg 


AGC 
Ser 


TAT 
Tyr 


TCC 
Ser 
960 


CAT 
His 


GAC 
Asp 


GTG 
Val 


TTG 
Leu 


CCA 
Pro 
965 


AAG 
Lys 


GAA 
Glu 


TCC 
Ser 


CCA 
Pro 


TCA 
Ser 
970 


AGC 
Ser 


ACG 
Thr 


1168 


GAG GCC GAC GGG GAG 
Glu Ala Asp Gly Glu 
975 


CTG 
Leu 


TAC 
Tyr 


ACC 
Thr 
980 


TTT 
Phe 


AAC 
Asn 


ACC 
Thr 


CCA 
Pro 


TCT GGG ACT 
Ser Gly Thr 
985 


GCA 

Ala 


1216 


GGT GTA 
Gly Val 
990 


GAG 
Glu 


ACG 
Thr 


CAG 
Gin 


ATG 
Met 


AGA 
Arg 
995 


CAT 
His 


GTA 
Val 


TCC 
Ser 


ATC 
He 


AGT TTC 
Ser Phe 
1000 


GAC 
Asp 


ATT 
He 


CCG 
Pro 


1264 


CCA ACA 
Pro Thr 
1005 


CCT 
Pro 


GGC AAC 
Gly Asn 


ACT TAC CAG 
Thr Tyr Gin 
1010 


ATC 
He 


CCA 
Pro 


CGG ACA 
Arg Thr 
1015 


TTT 
Phe 


CCA 
Pro 


GAG 
Glu 


AGC 
Ser 
1020 


1312 


ACA 
Thr 


CTG GGA CAG TCA TCA 
Leu Gly Gin Ser Ser 
1025 


AAG 
Lys 


CTG GAC 
Leu Asp 


ACC ATT 
Thr He 
1030 


CCT 
Pro 


GAT 
Asp 


ATC 
He 


CCC CCA 
Pro Pro 
1035 


1360 


CCT 
Pro 


CGG 
Arg 


CCA 
Pro 


CCA AAG 
Pro Lys 
1040 


CCA 
Pro 


CAT 
His 


CCA 
Pro 


ACT CAT 
Thr His 
1045 


GAC 
Asp 


CGG 
Arg 


TCT 
Ser 


CCT GTG 
Pro Val 
1050 


GAA 
Glu 


1408 


ACG 
Thr 


TGT 
Cys 


GGA GTC 
Gly Val 
1055 


CCA 
Pro 


CGC 
Arg 


ACG 
Thr 


GCC TCG 
Ala Ser 
1060 


GAC 
Asp 


ACT 
Thr 


GAC 
Asp 


AGC AGT 
Ser Ser 
1065 


TAC 
Tyr 


TGT 
Cys 


1456 


ATC 
IJe 


CCT CCT 
Pro Pro 
1070 


CCA 
Pro 


GCA 
Ala 


GGC 
Gly 


ATG ACG 
Met Thr 
1075 


CCC 
Pro 


TCC 
Ser 


CGG 
Arg 


AGT AAT 
Ser Asn 
1080 


ACC 
Thr 


ATT 
He 


TCC 
Ser 


1504 


ACC GTG 
Thr Val 
1065 


GAT 
Asp 


TTG 
Leu 


AAC 
Asn 


AAG TTG 
Lys Leu 
1090 


CGG 
Arg 


AAA 
Lys 


GAT 
Asp 


GCT AGT TCT CAA GAT 
Ala Ser Ser Gin Asp 
1095 


TGC 
Cys 
1100 


1552 


TAT GAT ATT 
Tyr Asp lie 


CCA CGG ACC 
Pro Arg Thr 
1105 


TTT 
Phe 


CCG 
Pro 


AGC 
Ser 


GAT^ AGA 
Asp* Arg 
1110 


TCT 
Ser 


AGT TGC 
Ser Cys 


CTG GAA. 
Leu Glu 
1115 


1600 


GGC 

Gly 


TTC 
Phe 


CAT 
His 


AGC CAG 
Ser Gin 
1120 


TAT 
Tyr 


AAA 
Lys 


ATC 
lie 


AAA AGC 
Lys Ser 
1125 


GTG 
Val 


TTG 
Leu 


ACA GCG GGA GGT 
Thr Ala Gly Gly 
1130 


1648 
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GTC TCG GGT GAA GAG CTG GAT GAG AAC TAC GTT CCC ATG AAC CCC AAC 1696 
Val Ser Gly Glu Glu Leu Asp Glu Asn Tyr Val Pro Met Asn Pro Asn 
1135 1140 1145 

TCG CCA CCT CGA CAA CAT TCC GGC AGC TTT ACC GAG CCA ATC CAG GAG 1744 
Ser Pro Pro Arg Gin His Ser Gly Ser Phe Thr Glu Pro lie Gin Glu 
1150 1155 1160 

CCA AAC TAT GTG CCA ATG ACC CCA GGG ACC TTT GAC TTT TCT TCC TTT 1792 
Pro Asn Tyr Val Pro Met Thr Pro Gly Thr Phe Asp Phe Ser Ser Phe 
1165 1170 1175 1180 

GGA ATG CAA GTC CCT CCT CCT GCT CAT ATG GGC TTC AGG TCC AGC CCA 1840 
Gly Met Gin Val Pro Pro Pro Ala His Met Gly Phe Arg Ser Ser Pro 
1185 1190 1195 

AAG ACC CCT CCC AGG AGG CCA GTT CCT GTT GCT GAC TGT GAA CCA CCC 1888 
Lys Thr Pro Pro Arg Arg Pro Val Pro Val Ala Asp Cys Glu Pro Pro 
1200 1205 1210 

CCG GTG GAT AGG AAC CTC AAG CCA GAC AGA AAA GTC AAG CCG GCA CCT 1936 
Pro Val Asp Arg Asn Leu Lys Pro Asp Arg Lys Val Lys Pro Ala Pro 
1215 1220 1225 

TTA GAC ATA AAA CCT CTG TCA GAA TGG GAA GAG CTG CAA GCC CCA GTC 1984 
Leu Asp lie Lys Pro Leu Ser Glu Trp Glu Glu Leu Gin Ala Pro Val 
1230 1235 1240 

AGA TCT CCC ATC ACC AGG AGC TTC GCT CGG GAC TCC TCT AGG TTT CCC 2032 
Arg Ser Pro He Thr Arg Ser Phe Ala Arg Asp Ser Ser Arg Phe Pro 
1245 1250 1255 1260 

ATG TCC CCT CGG CCT GAT TCT GTG CAC AGT ACG ACA TCG AGC AGC GAC 2 08 0 

Met Se- Pro Arg Pro Asp Ser Val His Ser Thr Thr Ser Ser Ser Asp 
1265 1270 1275 

TCT CAT GAC AGT GAA GAG AAC TAT GTC CCC ATG AAT CCA AAT CTG TCT 2128 
Ser His Asp Ser Glu Glu Asn Tyr Val Pro Met Asn Pro Asn Leu Ser 
1280 1285 1290 

GGC GAA GAC CCG AAT CTC TTT GCC AGC AAC AGC CTT GAT GGG GGA AGC 2176 
Gly Glu Asp Pro Asn Leu Phe Ala Ser Asn Ser Leu Asp Gly Gly Ser 
1295 1300 1305 

AGC CCG ATG AAT AAA CCC AAA GGA GAC AAA CAA GTC GAA TAC CTG GAT 2224 
Ser Pro Met Asn Lys Pro Lys Gly Asp Lys Gin Val Glu Tyr Leu Asp 
1310 1315 1320 

TTA GAC CTA GAT TCT GGG AAG TCC ACG CCA CCA CGG AAG CAA AAG AGC 2272 
Leu Asp Leu Asp Ser Gly Lys Ser Thr Pro Pro Arg Lys Gin Lys Ser 
1325 1330 1335 1340 

AGT GGT TCT GGC AGC AGC ATG GCA GAC GAG AGG GTG GAT TAC GTT GTG 2320 
Se^ Gly Ser Glv Ser Ser Met Ala Asp Glu Arg Val Asp Tyr Val Val 
* 1345 1350 1355 

GTG GAC CAA CAG AAG ACT CTG GCC CTG AAG AGT ACC AGA GAA GCT TGG 236 8 

Val Asp Gin Gin Lys Thr Leu Ala Leu Lys Ser Thr Arg Glu Ala Trp 
1360 1365 1370 

ACG GAT GGG AGG CAG TCC ACA GAG TCC GAG ACA CCC ACC AAG AAT GTG 2416 
Thr Asp Gly Arg Gin Ser Thr Glu Ser Glu Thr Pro Thr Lys Asn vai 
1375 1380 1385 

AAG TGAAGACATG CCGTCGCCTC TGCCGGCAGA CGAGATCTGA GTTGGAAAGA 246 9 

Lys 
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GAGATGGCCA 


AGTGAAGATG 


TTCCCACTCT 


CAGTGGGAGC 


CTCGAGCCAG 


CAGGGGCAGA 


2529 


GAGGAAGGAT 


CTCTCACACA 


TGTTCAAGCA 


AATTTAGGTT 


GTGAATTGGT 


GCTGTGTGGT 


2589 


ATTGGATTTA 


TAACGTGTAA ATAACCCGGG 


GAAATAGTGT 


TTTTAGTTCA 


CAGAGAAGCT 


2649 


TCTGTCCCTA 


ATTAACACAC 


CTGTAGTATT 


ACTATACTGA 


TGCACTTTTC 


ATTTAAAACC 


2709 


TTGGTTTGGG 


TCTTCCCGAT 


CTACCTTAAC 


AGACTTTCCT 


TGGGAGGTCT 


TTTGGCCTCC 


2769 


TCACACTACT 


CTATATAACA 


ATACTAAGTG 


ACTGAGCTAC 


TTGTAATTCT 


GGAAATTCCA 


2829 


GTTGAAGCTA 


CAGGGCTAAC 


ACCATTAAAA 


CGAGAAGTGA 


AGTTGACACA 


TTCGCTTTTC 


2889 


TCTTGAAGGT 


GGTAGCCATT 


AGCTTAAGCT 


GTAGAACATA 


GTTGGACTTG 


TCCTTCGTTG 


2949 


TTTTCCCAAA 


AATTCCGGGG 


ATATTGTATA 


TAGCAGGTCA 


AGACCTAGCT 


CTCTGACTCA 


3009 


TGTACACTTA 


GGTTTTAACT 


GTAGGACTTT 


GTTATTAATA 


TTTTTTTTGT 


TAATGACAGT 


3069 


GTTGGGTTCA 


TCGTGTGAAG 


GTTCTGCTGG 


GTAGGATCTT 


GCACCTTTCA 


AAGACTGCCT 


3129 


CTTAGTTACA 


CTAGTAAGCC 


CCCGAATCAT 


CCACAGCATG 


GACTGCTGGC 


CTGCTCTTAC 


3189 


TCCTGTTTAT 


GTGTTAAACA 


TTATCTGCGA 


AAGGCAGATT 


ATACGACTGA 


CCGATCAGGT 


3249 


ACGTACAAGG 


CACTGATGTG 


CTAATACAGT 


GATTGGGTCA 


GACAAAGTGC 


TTCAGTTAGT 


3309 


GTGCGTTCGT 


CCTAATCTTG 


GTTTAGAATT 


AATGAAACAG 


TTGGCGTTCA 


CTGTCAGCAG 


3369 


CATAGTGTGA 


TTTTGAATGA 


ATTAGGCAGG 


AATTCAAGAT 


TACTACT 




3416 



(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Ser Gly Gly Glu Val Val Cys Ser Gly Trp Leu Arg Lys Ser Pro 
15 10 15 

Pro Glu Lys Lys Leu Lys Arg Tyr Ala Trp Lys Arg Arg Trp Phe Val 
20 25 30 

Leu Arg Ser Gly Arg Leu Thr Gly Asp Pro Asp Val Leu Glu Tyr, Tyr 
35 40 45 

Lys Asn Asp His Ala Lys Lys Pro lie Arg lie lie Asp Leu Asn Leu 
50 55 60 

Cys Gin Gin val Asp Ala Gly Leu Thr Phe Asn Lys Lys Glu Phe Glu 
65 70 75 80 

Asn Ser Tyr He Phe Asp He Asn Thr He Asp Arg He Phe Tyr Leu 
85 90 95 

Val Ala Asp Ser Glu Glu Asp Met Asn Lys Trp Val Arg Cys He Cys 
100 105 no 

Asp He Cys Gly Phe Asn Pro Thr Glu Glu Asp Pro Val Lys Pro Leu 
115 120 n-pc 
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Thr Gly Ser Ser Gin Ala Pro Val Asp Ser Pro Phe Ala lie Ser Thr 
130 135 140 

Ala Pro Ala Ser Ser Gin Met Glu Ala Ser Ser Val Ala Leu Pro Pro 
145 150 155 160 

Pro Tyr Gin Val He Ser Leu Pro Pro His Pro Asp Thr Leu Gly Leu 
165 170 175 

Gin Asp Asp Pro Gin Asp Tyr Leu Leu Leu He Asn Cys Gin Ser Lys 
180 185 190 

Lys Pro Glu Pro Asn Arg Thr Leu Phe Asp Ser Ala Lys Pro Thr Phe 
195 200 205 

Ser Glu Thr Asp Cys Asn Asp Asp Val Pro Ser His Gin Thr Pro Ala 
210 215 220 

Ser Ser Gin Ser Lys His Gly Met Asn Gly Phe Phe Gin Gin Gin Met 
225 230 235 240 

Met Tyr Asp Cys Pro Pro Cys Arg Leu Thr Ser Val Ser Gly Glu Ser 
245 250 255 

Ser Leu Tyr Asn Leu Pro Arg Ser Tyr Ser His Asp Val Leu Pro Lys 
260 265 270 

Glu Ser Pro Ser Ser Thr Glu Ala Asp Gly Glu Leu Tyr Thr Phe Asn 
275 280 285 

Thr Pro Ser Gly Thr Ala Gly Val Glu Thr Gin Met Arg His Val Ser 
290 295 300 

He Ser Phe Asp He Pro Pro Thr Pro Gly Asn Thr Tyr Gin He Pro 
305 310 315 320 

Arg Thr Phe Pro Glu Ser Thr Leu Gly Gin Ser Ser Lys Leu Asp Thr 
325 330 335 

He Pro Asp He Pro Pro Pro Arg Pro Pro Lys Pro His Pro Thr His 
340 345 350 

Asp Arg Ser Pro Val Glu Thr Cys Gly Val Pro Arg Thr Ala Ser Asp 
355 360 365 

Thr Asp Ser Ser Tyr Cys He Pro Pro Pro Ala Gly Met Thr Pro Ser 
370 375 380 

Arg Ser Asn Thr He Ser Thr Val Asp Leu Asn Lys Leu Arg Lys Asp 
385 390 395 400 

Ala Ser Ser Gin Asp Cys Tyr Asp He Pro Arg Thr Phe Pro Ser Asp 
405 410 415 

Arg Ser Ser Cys Leu Glu Gly Phe His Ser Gin Tyr Lys He Lys Ser 
420 425 430 

Val Leu Thr Ala Gly Gly Val Ser Gly Glu Glu Leu Asp Glu Asn Tyr 
435 440 445 

Val Pro Met Asn Pro Asn Ser Pro Pro Arg Gin His Ser Gly Ser Phe 
450 455 46 ° 

Thr Glu Pro He Gin Glu Pro Asn Tyr Val Pro Met Thr Pro Gly Thr 
465 470 475 480 

Phe Asp Phe Ser Ser Phe Gly Met Gin Val Pro Pro pro Ala His Met 
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485 490 495 

Gly Phe Arg Ser Ser Pro Lys Thr Pro Pro Arg Arg Pro Val Pro Val 
500 505 510 

Ala Asp Cys Glu Pro Pro Pro Val Asp Arg Asn Leu Lys Pro Asp Arg 
515 520 525 

Lys Val Lys Pro Ala Pro Leu Asp He Lys Pro Leu Ser Glu Trp Glu 
530 535 540 

Glu Leu Gin Ala Pro Val Arg Ser Pro He Thr Arg Ser Phe Ala Arg 
545 550 555 560 

Asp Ser Ser Arg Phe Pro Met Ser Pro Arg Pro Asp Ser Val His Ser 
565 570 575 

Thr Thr Ser Ser Ser Asp Ser His Asp Ser Glu Glu Asn Tyr Val Pro 
580 585 590 

Met Asn Pro Asn Leu Ser Gly Glu Asp Pro Asn Leu Phe Ala Ser Asn 
595 600 605 

Ser Leu Asp Gly Gly Ser Ser Pro Met Asn Lys Pro Lys Gly Asp Lys 
610 615 620 

Gin Val Glu Tyr Leu Asp Leu Asp Leu Asp Ser Gly Lys Ser Thr Pro 
625 630 635 640 

Pro Arg Lys Gin Lys Ser Ser Gly Ser Gly Ser Ser Met Ala Asp Glu 
645 650 655 

Arg Val Asp Tyr Val Val Val Asp Gin Gin Lys Thr Leu Ala Leu Lys 
660 665 670 

Ser Thr Arg Glu Ala Trp Thr Asp Gly Arg Gin Ser Thr Glu Ser Glu 
675 680 685 

Thr Pro Thr Lys Asn Val Lys 
690 695 
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CLAIMS 

1. a substantially pure protein having the amino acid 
sequence of SEQ ID NO:2. 

2. The protein of claim 1 wherein said protein has the 
5 amino acid sequence of SEQ ID NO: 2. 

3. An isolated nucleic acid molecule that comprises a 
nucleic acid sequence that encodes the protein of claim 1. 

4. A pharmaceutical composition comprising the nucleic 
acid molecule of claim 3 and a pharmaceutical^ acceptable 

10 carrier. 

5. An isolated nucleic acid molecule consisting of SEQ 
ID N0:1 or a fragment thereof having at least 10 nucleotides. 

6. The nucleic acid molecule of claim 5 consisting of SEQ 
ID NO:l. 

15 7. a recombinant expression vector comprising the nucleic 

acid molecule of claim 6. 

8. A host cell comprising the recombinant expression 
vector of claim 7. 

9. The nucleic acid molecule of claim 5 consisting of a 
20 fragment of SEQ ID NO:l having at least 10 nucleotides. 

10. The nucleic acid molecule of claim 5 consisting of a 
fragment of SEQ ID NO:l having 12-150 nucleotides. 

11. The nucleic acid molecule of claim 5 consisting of a 
fragment of SEQ ID N0:1 having 15-50 nucleotides. 



BNSDOCID: <WO 9707B27A1 J_> 



WO 97/07827 



PCT/US96/13842 



- 60 - 

12. An oligonucleotide molecule comprising a nucleotide 
sequence complimentary to a nucleotide sequence of at least 5 
nucleotides of SEQ ID N0:1. 

13. The oligonucleotide molecule of claim 12 wherein said 
5 oligonucleotide molecule comprises a nucleotide sequence 

complimentary to a nucleotide sequence of 5-50 nucleotides of 
SEQ ID NO:l. 

14. The oligonucleotide molecule of claim 12 wherein said 
oligonucleotide molecule comprises a nucleotide sequence 

10 complimentary to a nucleotide sequence of 10-40 nucleotides of 
SEQ ID N0:1. 

15. The oligonucleotide molecule of claim 12 wherein said 
oligonucleotide molecule comprises a nucleotide sequence 
complimentary to a nucleotide sequence of 15-25 nucleotides of 

15 SEQ ID N0:1. 

16. The oligonucleotide molecule of claim 12 consisting 
of a nucleotide sequence complimentary to a nucleotide sequence 
of at least 10-150 nucleotides of SEQ ID N0:1. 

17. The oligonucleotide molecule of claim 12 consisting 
20 of a nucleotide sequence complimentary to a nucleotide sequence 

of at least 18-28 nucleotides of SEQ ID N0:1. 

18. An isolated antibody which binds to an epitope on SEQ 
ID NO: 2. 

19. The antibody of claim 18 wherein said antibody is a 
25 monoclonal antibody. 

20. a method of identifying modulators of Gabl binding to 
Grb2, PLC-Y, PI-3-kinase, SKPTP2/syp, Phosphatidylinositol , 
4, 5-bisphosphate or the /3y subunit of heterotrimeric G protein 
comprising the steps of: 
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performing a test assay by contacting Gabl with Grb2 , 
PLC-7, PI-3-kinase, SHPTP2/syp, Phosphatidylinositol , 4,5- 
bisphosphate or the j3y subunit of heterotrimeric G protein in 
the presence of a test compound under conditions in which said 
5 Gabl would bind to Grb2, PLC-y, PI-3-kinase, SHPTP2/syp, 
Phosphatidylinositol, 4 , 5-bisphosphate (PIP 2 ) or the fty subunit 
of heterotrimeric G protein in the absence of said test 
compound, 

determining whether said Gabl binds to Grb2, PLC-y, 
10 PI-3 -kinase, SHPTP2/syp, Phosphatidylinositol, 4 , 5-bisphosphate 
(PIP 2 ) or the (3y subunit of heterotrimeric G protein. 

21. A method of identifying inhibitors of tyrosine or 
serine/threonine kinase activity on Gabl comprising the steps 
of: 

15 performing a test assay by contacting Gabl with 

tyrosine kinase or serine threonine kinase in the presence of 
a phosphorus donor and a test compound under conditions in 
which said Gabl would be phosphorylated in the absence of said 
test compound, 

20 comparing the level of phosphorylated Gabl to 

unphosphorylated Gabl in the presence of the test compound to 
the level of phosphorylated Gabl to unphosphorylated Gabl in 
the absence of the compound. 

22. A method of inhibiting expression of Gabl comprising 
25 the step of : 

contacting cells that express Gabl with a nucleic acid 
molecule that comprises oligonucleotide molecule that comprises 
a nucleotide sequence complimentary to a nucleotide sequence 
of 5-50 nucleotides of SEQ ID NO:l. 

30 23. A method of identifying compounds that modulate PI-3 

kinase protein binding to Gabl protein comprising the steps of : 
comparing the level of Gabl binding t:o PI-3 kinase in 
cells in which Gab-1 is phosphorylated and in the presence of 
a test compound to the level of Gabl binding to PI -3 kinase in 
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cells in which Gab-1 is phosphorylated and in the absence of 
the test compound 
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FIGURE 1 
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